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This  dissertation  proposes  an  alternative  to  normal  (large  sample) 
approximation  for  the  tail  probability  approximation  (under  both  null  and  alternative 
hypotheses)  for  one  and  two  sample  U-statistics  by  saddlepoint  methods. 

Under  minimal  assumptions,  approximations  for  the  cumulant  generating 
function  of  both  one  and  two  sample  standardized  U-statistics  are  derived  by  using 
Hoeffding’s  decomposition.  These  approximations  are  then  used  in  Daniel’s 
formulation  (for  a discrete  case)  of  Lugannani  and  Rice’s  saddlepoint  approximation 
to  the  tail  probability. 

This  method  has  been  applied  to  both  the  Wilcoxon  Signed  Rank  test  and  the 
Mann- Whitney  form  of  Wilcoxon’s  Rank  Sum  test  to  approximate  the  tail  probability 
under  null  and  various  shift  normal,  logistic  and  Cauchy)  alternatives.  Finally,  results 
using  our  approximation  have  been  compared  to  both  normal  approximation  and 
exact  distribution  (based  on  10  million  simulations)  for  both  the  examples. 
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CHAPTER  1 
INTRODUCTION 

1.1  Review 

P.  R.  Halmos  (1946)  is  credited  with  first  creating  interest  in  the  existence  of 
U-statistics.  Wassily  Hoeffding  (1948)  formally  introduced  them  in  his  Annals  of 
Mathematical  Statistics  paper.  Since  that  time,  many  articles,  along  with  quite  a few 
books,  have  extended  the  theory  and  introduced  new  applications  associated  with  U- 
statistics.  The  class  of  U-statistics  is  important  for  the  following  reasons: 

• A number  of  commonly  used  statistics  are,  in  fact,  U-statistics. 

• The  simple  but  unified  structure  of  U-statistics  makes  them  ideal  for  studying 
general  inference  processes  and  for  generalizing  those  parts  of  asymptotic  theory 
concerned  with  the  behavior  of  sequences  of  sample  means. 

• Application  of  the  theory  often  generates  new  statistics  useful  in  practical 
inference  problems. 

• Minimal  assumptions  regarding  the  underlying  distributions  are  needed. 

Even  with  all  this  attention  focused  on  U-statistics,  it  has  traditionally  been 
hard  to  handle  quite  a few  U-statistics  based  tests  which  have  immense  potential  for 
numerous  applications.  The  main  difficulty  has  been  the  conspicuous  absence  of 
exact  methods  in  general  for  the  whole  class. 


1 


2 


1.2  Problem 

Methods  to  estimate  the  tail  probabilities  (under  both  null  and  alternatives) 
exist  for  only  a few  U-statistics.  Even  for  those  U-statistics  for  which  exact  methods 
do  exist,  calculation  of  tail  probabilities  can  be  fairly  time  consuming,  even  for 
moderate  sample  sizes.  Moreover,  even  if  they  exist,  they  may  not  be  as  well  known 
as  one  might  expect. 

Normal  approximations  based  on  asymptotic  distributions  give  an  alternative 
approach  for  approximating  tail  probability,  but  are  not  very  good  if  either  the 
sample  size  is  not  large  enough  or  the  underlying  distribution  is  heavy  tailed.  For 
power  calculation,  even  for  normal  shift  alternatives  in  one  and  two  sample  settings, 
if  A (shift  parameter)  is  not  small  enough,  it  is  quite  possible  that  normal  (large 
sample)  approximation  may  not  be  very  good  (Lehmann,  1975). 

Simulation  at  times  can  solve  the  problem  of  estimating  the  tail  probability 
for  the  U-statistics;  however,  it  can  obviously  be  influenced  by  the  accuracy  of 
random  number  generators.  As  a result  of  random  fluctuations,  there  is  always  a 
chance  of  coming  up  with  strange  results  for  a tail  probability. 

The  main  goal  of  this  research  was  to  come  up  with  a tail  probability 
approximation  via  saddlepoint  methods  for  the  general  class  of  U-statistics,  one 
sample  degree  K and  two  sample  degree  (Kj,K2),  which  could  be  used  under  both 
null  and  alternative  hypotheses.  Numerical  examples  (both  in  one  and  two  sample 
settings)  show  that  the  proposed  method  is  better  than  normal  approximation  and 
other  methods  such  as  Edgeworth  expansion. 
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Lugannani  and  Rice  (1980)  derived  a saddlepoint  approximation  for  sum  of 
n independent  and  identically  distributed  random  variables.  According  to  I.  M. 
Skovgaard  (1987),  the  approximation  can  be  used  for  an  arbitrary  sample  size  n,  but 
they  note  that  it  is  applicable  for  n = 1.  We  decided  to  apply  Lugannani  and  Rice’s 
(1980)  saddlepoint  approximation  for  the  distribution  function  under  a discrete 
distribution  for  the  standardized  U-statistics,  i.e.,  n = 1.  H.  E.  Daniels  (1987) 
derives  a saddlepoint  approximation  for  the  distribution  function  under  the  discrete 

case  when  x > E { x ),  i.e.,  observed  value  of  the  average  is  greater  than  the 

expected.  By  applying  Daniel’s  approximation  we  got  a saddlepoint  approximation 

for  the  distribution  function  for  a discrete  random  variable  for  ^ E { x ),  i.e., 

observed  is  less  than  the  expected,  and  also  combined  these  two  results  in  a more 
general  formulation  for  the  tail  (right  or  left)  probability  approximation. 

To  use  this  saddlepoint  approximation  result  for  the  distribution  function  of 
any  standardized  U-statistics,  we  had  to  approximate  the  cumulant  generating 
function  of  both  one  sample  (degree  K)  and  two  sample  (degree  Kj  and  K2)  U- 
statistics  in  a general  framework  by  using  Hoeffding’s  decomposition  of  the  U- 
statistics  into  several  uncorrelated  U-statistics. 

In  approximating  the  cumulant  generating  function,  we  tried  to  keep  the  order 
minimal  in  view  of  to  approximate  the  tail  probability  by  the  saddlepoint 
methodology  (as  suggested  by  I.  M.  Skovgaard,  1987).  This  approach  enabled  us  to 
propose  an  alternative  to  the  method  of  approximation  for  the  tail  probability  of  a 
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standardized  U-statistic.  The  proposed  technique  has  been  applied  under  null  and 
various  shift  alternatives  for  normal,  logistic  and  Cauchy  distributions  for  examples 
in  one  and  two  sample  settings. 

Comparisons  through  our  numerical  example  were  also  carried  out  in  the  one 
sample  (degree  2)  case  between  the  method  of  Edgeworth  expansion  as  suggested 
by  P.  J.  Bickel,  F.  Gotze  and  W.  R.  Van  Zwet  (1986)  and  the  saddlepoint 
approximation  proposed  above  for  the  tail  probability.  Our  examples  show  that  tail 
probability  approximation  did  better  than  the  Edgeworth  expansion. 

The  density  of  general  statistics  was  also  estimated  using  the  saddlepoint 
approximation  by  George  S.  Easton  and  Elvizio  Ronchetti  (1986).  They 
approximated  the  first  four  cumulants  individually  and  then  used  these  to 
approximate  the  cumulant  generating  function.  Using  the  G.  S.  Easton  and  E. 
Ronchetti’s  approach,  we  tried  to  approximate  the  cumulant  generating  function  by 
first  four  exact  cumulants  and  used  it  to  approximate  the  tail  probability.  We  then 
compared  the  result’s  of  their  tail  probability  approximation  with  our  saddlepoint 
approximation  using  an  approximated  cumulant  generating  function  by  applying 
Hoeffding’s  decomposition  of  U-statistics  as  derived  in  this  dissertation.  Our 
numerical  examples  show  that  our  method,  again,  consistently  did  better  over  the 
whole  range  of  critical  regions  for  both  one  and  two  sample  problems  under  null  and 
various  (normal,  logistic  and  Cauchy)  alternatives. 

In  our  examples  saddlepoint  approximation  is  closer  to  exact  tail  probabilities 
compared  to  normal  (large  sample)  approximation.  Improvement  in  the 
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approximation  for  these  examples  is  fairly  substantial  by  using  saddlepoint 
approximation.  Computing  time  is  negligible  since  we  are  using  a formula,  and 
obviously,  it  is  much  less  subjective  compared  to  simulation.  Thus,  reproduction  of 
results  is  not  a problem.  As  a result  we  try  to  show  in  this  dissertation  that  it  will 
provide  an  alternative  to  normal  approximation. 


CHAPTER  2 

SADDLEPOINT  APPROXIMATIONS 


Saddlepoint  approximations  were  introduced  by  H.  E.  Daniels  (1954)  in 
Annals  of  Mathematical  Statistics.  Based  on  the  theory  of  complex  functions,  Daniels 
(1954)  addressed  the  problem  of  existence  and  uniqueness  of  a saddlepoint.  Let  X 
be  a random  variable  with  probability  function,  f(x).  Then  its  moment  generating 
function  is  defined  by 

W(  r)  = /_■■«’■'  f(x)d(x-) 

By  the  inversion  formula  for  characteristic  functions  we  have 
f(x-)  = f-  f'M(it)  dt 

lit  — 

= J_  t-exm-u--  a, 

271 

= — dr.  (2.1) 

where  K(T)  is  the  cumulant  generating  function  of  X.  Since  the  density  function  is 
expressed  in  terms  of  K(T)  - Tx*  in  the  above  inversion  formula,  we  need  to  study 
the  exponent  of  the  integrand  carefully.  Observe  that  the  exponent  is  minimized  by 
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T that  satisfies  the  following  equation, 

K'{T)  = X*  . (2-2) 

Existence  of  real  roots  for  (2.2)  has  been  established  under  mild  and  quite  general 
conditions  by  H.  E.  Daniels  (1954).  Daniels  also  addressed  the  uniqueness  of  the 
saddlepoint  without  proof.  A detailed  proof  is  given  in  lemma  2.1  and  the  paragraph 
following  this  lemma. 

Lemma  2.1:  Convexity  of  cumulant  generating  function. 

Proof:  We  use  Holder’s  inequality  to  prove  this  result,  i.e., 

1 1 

E \ X Y \ £ { E \ X \P  y { E \ Y 

where  P > 1 and  — + — = 1. 

P Q 

For  a e (0,1)  we  have 

(oc-fj  + ( 1 - a yt^) 

E 6^“^'  + ( i-«  )-«2 ) •* 

= £:[(«'*'')“(  )’-“  ] 

choosing  p = — and  q = ^ and  applying  Holder’s  inequality,  we  have 

a 1-a 


= [ M ( f j ) ] “ . I M ( ) 1 


1-0 
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and  by  taking  natural  logarithm  on  both  sides  we  get 

K (a^t^  + ( 1-a  ) f2  ) ^ a A"  ( tj  ) + ( 1-a  ) • A-  ( ) 

and  hence  this  completes  the  proof  of  the  lemma. 

The  convexity  of  cumulant  generating  function  gives  us  the  uniqueness  of  the 
minimum  of  K(T)  - Tx*  attained  at  a point  Tq  along  the  real  line.  Also  note,  K(T)  - 
Tx*  is  an  analytic  function  of  T(T=x  + iy)  and  thus  by  Cauchy-Riemann  theorem 
for  analytic  functions  if 

K(T)  - Tx*  = U(  x*,y  ) + iV(  x* ,y  ) 

then, 

^ _c^  = (2.3) 

dx^  dxdy  dy^  ' 


dx  dy 

dU  ^ 

dy  dx 


Also, 


I = gl/  I I 


I gX(7)  - 7i  I ^ 

Along  the  real  line  since  K(T)  - Tx*  is  real,  i.e.,  V = 0,  so  that  we  have 


dx 


= 0 - K'(Tq)  - X*  = 0 


A=t„ 


and  if 
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^ dx^ 


= K"{J^  > 0 

/jr=To 


then,  a minimum  is  attained  at  Tg  along  the  real  line.  Since  F(T)  = K(T)  - Tx  is  an 
analytic  function,  the  first  derivative  is  zero  along  any  direction  at  Tg  and  as  a result 


It 

o 

II 

du 

idV  ) 

=0 

^ dy  ( 

y=0 

dy 

dy  J 

y*0 

dy 


0 and 


and  also  from  eq.  (2.3)  we  have. 


< 0. 


x=T^ 

y.Q 


Therefore,  a maximum  for  U is  attained  along  the  imaginary  axis  and  hence  Tg,  the 
solution  of  equation  (2.2),  is  a unique  saddlepoint  for  U. 

Let  the  random  variable  X take  values  on  a lattice  with  span  = 1 and  with 
probability  function  P(x  = r)  = p,.  Thus,  if  M(T)  is  the  moment  generating  function 
and  k(T)  is  the  cumulant  generating  function,  then  we  have 

M(T)  = exp  k(T)  = p,  exp  (T-r). 

The  probability  P(S  = s)  = p„  (s)  for  the  sum  S = X Xj  is  given  by 
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Pn(^) 


J_  n-^nk(T)-T. 

2ni  J -«  » 


Thus  the  tail  probability  when  s > n E(x)  is 


Qn.=  T,  Pn(^)  = — r*''  ^ 


(2.3) 


where  c > 0. 

Proceeding  from  eq.  (2.3)  H.  E.  Daniels  (1987)  comes  up  with  a saddlepoint 
approximation  based  on  Lugannani  and  Rice’s  (1980)  formula  to  the  tail  probability 
when 


X > E(  X ) i.e.  s > n E(  X ), 
i.e.,  the  saddlepoint,  f is  positive. 

The  tail  probability  is  then  given  by 


P[S  ^s]  = I - <!>  (i)  + i^it) 


1 _ 1 .O(n-^l^) 

2 $ 


(2.4) 


where 

i = [2n(t -X  - 
£ = [ 1 - exp(  -f ) ](  n k"  ) 

A ^ 

where  K is  the  cumulant  generating  function  evaluated  at  T. 


Let  Y = -X 


then  if 
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S = Y>Xj  , S'  = 5^r..  note:  S'  = -S. 

If  K(T)  is  the  moment  generating  function  of  Y,  then  K(-T)  is  the  moment 
generating  function  of  X,  i.e., 

Ky(T)  = K^i-T)  - K'y(T)  = -K'x(  -T) 

- Kyi  T)  = K'^i-T ). 

Let  Ty  ( > 0 ) be  the  solution  of  Ky{  Ty)  = s'  and 

Jy  ( < 0 ) be  the  solution  of  K^( 

KyiT)  = s'  - -K'j^i  -T)  = -s 

- 4(-n  = s 

- Ty=-Tx 

and  hence  we  have, 

ky  = Kyify)  = K^i-Jy)  = K T ^ ^ 

ky  = kyify)  = -k^i-fy)  = ~ k^i  T ^ (2-5) 

k'y  = k'yify)  = k'^i-fy)  = ^ ) = k 

From  equations  (2.4)  and  (2.5),  we  get  the  saddlepoint  approximation  based  on 
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Lugannani  and  Rice’s  formula  (1980)  to  approximate  the  tail  probability  when 

X < E(X),  i.e.,  S < n ■ E(X)  . 

I.e.,  the  saddlepoint,  t is  negative. 

It  is  given  by 


I - ^ U) 


<|)  il  ) 


- + O ( 

I. 


(2.6) 


where 


i =[2n(f^-X-k) 


z = [l  - cxp(f)]ink")^'^ 

A A _A 

where  P,  the  saddlepoint  is  negative  and  K is  evaluated  at  T. 

Equations  2.4  (right  tail  probability)  and  2.6  (left  tail  probability)  can  be 
combined  to  give  a tail  probability  approximation.  It  is  given  as  follows: 


P[Tail]  = \ ^ (V)  ^ 


— + O ( 


(2.7) 


where 

= [2n(  t ■ X - k) 
z'  = [l  - exp(  -Ifl  )]  - (nk")^^  ■ 
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Since  our  results  in  this  dissertation  are  associated  with  U-statistics  (centered 
about  zero)  and  we,  in  general,  have  worked  with  standardized  U-statistics,  we  felt 
the  need  to  have  a saddlepoint  approximation  for  a tail  (right  or  left)  probability  of 
the  standardized  U-statistics.  Let  U^  denote  a U-statistic  with  zero  mean  and  its 
standard  deviation  then 

Os  U,  = 


is  the  standardized  U-statistic.  We  observed  that  of  the  two  quantities  used  in  this 
approximation,  ^ was  scale  invariant  while  z was  not.  By  using  an  appropriate  scale 
change  transformation  on  z,  we  further  noted  that  the  tail  probability  approximation 
based  on  Lugannani  and  Rice’s  (1980)  approximation  for  n = 1 of  a standardized  U- 
statistic  is  given  by: 


P [ Tail  ] - 1 - <I>  ( ^ ) 


^ 4)  (5  ) 


(2.8) 


where 


5 = 2(T-U^^-K) 


1/2 


Z = 


1 - exp 


I .m 


• • ( r' ) 


" \ 1/2 


-AT 


Here,  ^ is  the  saddlepoint  which  is  the  solution  of  the  saddlepoint  equation 
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T)  is  the  C.G.F.  of  the  standardized  U-statistic  and  tail  means  right  (left)  tail 

if  >0  ). 

In  the  third  and  fourth  chapter,  we  will  look  at  the  problem  of  approximating 
the  cumulant  generating  function  of  a standardized  U-statistic  for  the  general  one 
sample  (based  on  a kernel  degree  K)  and  two  sample  case  (based  on  a kernel  of 
degree  Kj  and  K2)  by  using  Hoeffding’s  decomposition  (1961)  of  U-statistics. 


CHAPTER  3 

ONE  SAMPLE:  U-STATISTIC 


In  this  chapter  we  give  a brief  overview  of  existing  results  together  with  a 
formal  derivation  of  an  approximation  for  the  cumulant  generating  function  of  a one 
sample  U-statistic  based  on  a kernel  of  degree  K.  This  approximation  is  based  on 
the  well  known  Hoeffding’s  decomposition  (1961)  of  a U-statistic. 

We  start  in  Section  3.1  by  developing  the  notation  to  be  used  in  this 
manuscript.  We  also  discuss  some  known  results  using  this  new  notation  together 
with  their  proofs.  In  some  cases,  we  have  given  alternate  methods  of  proving  these 
results. 

In  Section  3.2  we  prove  two  lemmas  which  allow  us  to  use  the  martingale 
structure  of  a U-statistic  to  prove  that  except  for  a few  terms  in  the  expansion  based 
on  the  Hoeffding’s  decomposition,  everything  else  can  be  considered  as  a high  order 
term.  These  lemmas  are  actually  an  integral  part  of  this  dissertation  and  have  been 
used  often  in  approximations  for  cumulant  generating  function  of  both  one  sample 
degree  K and  two  sample  degree  Kj  and  K2,  U-statistics.  In  addition  to  these 
lemmas,  we  prove  the  approximation  for  cumulative  generating  function  of  the  one 
sample  U-statistics  of  degree  K. 

Finally,  we  discuss  an  example  (in  Section  3.3)  to  which  we  apply  this 
approximation  for  cumulant  generating  function  and  use  it  to  approximate  the  tail 
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probability  by  using  the  saddlepoint  methodology  and  comparing  these  results. 

3.1  Notation  and  Results 

Let  Xj,  X2,  ...,  Xn  be  independent  and  identically  distributed  (i.i.d.)  random 
variables  with  a common  distribution  F.  Let  h be  measurable  and  symmetric  in  its 
K arguments,  i.e., 

h ( X,  X,)  = h(X^^  X.^  ). 

For  N ^ K,  a U-statistic  of  degree  K is  defined  as 

= 

\ i i^  < ...  <i^  i N 

Note  that  in  the  above  definition  we  do  not  follow  the  usual  convention  of  dividing 

the  sum  in  3.1  by  ( ) of  its  terms.  Since  our  results  concern  the  standardized 

version  of  a U-statistic,  this  does  not  make  any  difference. 

We  assume  throughout  that 

E h(Xi,  ...,  XO  = 0,  E h'(Xi,  ...,  XO  < 00. 

Let 

gi  (xO  = E(h(Xi,  ...,  XO  |X,  = xO 

g2  (Xi,  X2)  = E(h(X„  ...,  X0|X,  = Xi,  X2,  = X2) 


gk  (xi,  ...  Xk)  = E (h(Xi,  ...,  Xk)  |Xi  = Xi,  ...,  Xk  = Xk) 
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T|;i  (x^)  = E (hi  X„  X,  ) I Xj  = ( j:,  ) 


\]f2  ( ) = ^2  ( ^1’  ^2  ) ■ 


E 

(2,J) 


:j=l. 


where  the  sum  is  over  all  the  number  of  distinct  combinations  of  size  1 out  of  2 X’s. 


2 


i|;3  ( Xj,  x^,  x^)  = g^(  x^,  x^,  Xj  ) - 


E E 

M (3.;) 


= 8k  i - ^k  ) - 


*-l 


E E 

y=i  (*.;■) 


X: 


Also  if 

° (AT-;)  (W-/-l)...(W-*-l)  = 0(ff‘-n 


then  we  further  define 


N 

f/j  = (iV-1)  (AT-2)...(A^-it+l)  Y, 

i=l 


♦i  ».)  = E 


1=1 


4^1  (Xi) 


f/j  = (iV-2)  (iV-3) . . . (N-k+l) 


1 ^ 


E E 

jj  < I2  ^ iV 


t2  ( ^1,  ) 
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- ^N-l,k-2  1 ^ N ’*'2  ^ \ ^ 


‘1  ‘2 


U,  - (N-3)  (JV-4)  . . . (N-*.l)  1 , f , at  »3  ( X,,  h ) 


''*  = 1 . f < /,  < .:  ^ ^ • *.. ) 


1 < ij ...  < f'i  W ^ *'|’  ■■■’  \ * 


where  r =1 

'•Ar.o  ^ • 

Now  under  these  definitions  for  the  general  one  sample  U-statistics  we  try  to 
address  some  preexisting  results  which  can  be  expressed  under  slightly  different 
forms  in  the  literature.  This  was  done  for  the  sake  of  preserving  the  continuity  and 
readability  of  the  manuscript. 

Let  us  start  with  Hoeffding’s  decomposition  for  the  one  sample  U-statistics 
with  a kernel  of  degree  k in  the  following  lemma. 


Lemma  3.1: 


;=i 


{k-j)\ 


Proof: 

5:  t,  ( Xp  X ) 
(*.;•) 


...,  X,)=g,{ 


t-i 

)-E 


= g,(Xj,  E 

;=i 


E ••  • E 

1 ij  -<  ...  < i.  k 


( ^1. 


% ( \ ) = St  ( 


k-l 


E 


E • 

1 Cj 


E 

ri  k 


St  ( \ ) 


= E 


^ . c,' < ..: \ ^ ) 


Also,  we  know  that 
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and  define 


^jN 


Then,  we  have 


N 1 ^ ij  K 


<i^iN 


fE  • ■ 

4^  1 C,  -<  ...  -< 


y=i 


c,  ‘ ■■■■  \ ) 


k 

- E 


izJ 

M (N 

[j . 


\ 


A rigorous  proof  of  the  above  result  is  given  in  Serfling  (1980),  pages  178-179, 
lemma  A (Hoeffding). 

Note  that. 


E • 

1 ^ 


■<  i,  ^ N 


V;  ( X,,. 


. N-M  ) E ■ ■ • E a,  f y 

( N-y  ) . . . ( iV-A:+l  ) 1 ^ ij  ^ ^ ^ 'V 


(N-j)  . . . ( AT-t+1  ) 
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N(N-1)  ..  . ^ 

N (N-l)  . . . (N-k+1)  ^ 


Ni 


Ni 


Uj, 


where 


= 


N ! 

( N-j  ) ! 


and  hence, 


k 


;=i 


(k\ 

Ni 


J. 


% 


= E — ^ 

M (k-j)  ! 


This  also  appears  in  A.  J.  Lee’s  (1990)  book,  as  a theorem,  even  though  the 
notation  is  not  the  same  as  used  here.  Now  we  use  the  principle  of  mathematical 
induction  to  prove  a lemma,  which  can  be  used  to  show  that  tp^'s  are  pair-wise 
uncorrelated.  The  following  lemma  also  appears  as  a theorem  with  different 
notation  in  Lee’s  book,  where  an  alternative  proof  is  given. 

Lemma  3.2:  For  any  k (a  positive  integer)  expectations  of  the  following  type 
are  all  equal  to  zero. 

£ [ i|/(  Xj,  Xj,  . . . , X,  ) ] = 0 
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£[i|/(Xj,  ^2,  )|XJ  = 0 


(3.3) 


£ [ t(  Xj,  X2,  . . . , X,  ) I Xj X,_i  ] = 0 

Proof:  Check  for  k=  1, 

£ [ ti(  X,  ) ] = £ [ /«(  Xj,  . . . , X,  ) ] = 0 . 
Now,  check  for  k=2, 


£ It2  (^1^2)]  = ^ 


g2(Xj,X2) 


:;=1 


= £[/»(  X,,  X,)  ] - £ TlTj  (Xj)  - £ (X2) 

= 0.  (3.4) 

£ I t2  (^P  ^2)  1^1  ] 

= g,  (X,)  - £ [ ilTi  (X2)  |X,  ) - £ [ T|;i  (X,)  I Xj)  ] 

= g,(  Xj  ) - g,(  X,  ) = 0 . (3.5) 

Thus,  from  equations  3.4  and  3.5  we  can  say  that  (3.3)  holds  for  k=2. 

Assume  that  (3.3)  is  true  for  all  k < n.,  i.e., 

£ [ t(  X,,  X2,  . . . , X^  ) ] = 0 


£ [ t(  Xj,  X2,  . . . , XJ  I XJ 


0 


(3.6) 
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E[^{X^,X^,  . . . ,X,)\X„  . . . , X,.,  ] = 0 
Now,  let  us  check  for  k=n+l. 

E [ I-..,  ( X, ] = 0. 

E [ (X„  ....  ) I X,.,  ] 

= «i  (X...)  - E E £ [ ( X...  X,,  ) I Kx 

7=1  B+1,/ 


= S.  (X,„)  - I £ [ K,  ( x,„  ) I X,„  ] + 0 1 

(t 

from  the  assumption  (3.6)  and  the  fact  that  X/s 

= = 0. 

E [ il/n+i  (Xj,  ...,  X^^^  ) I X^^j  , X^,  ...,  X.  ] ; 1 ■<  i -< 


S{n*2-i)^  ^i>  ■^1+1  » •"»  -^B+1  ) 


- E E El%(X  ...  X ) I X„,„  ...,  X,  ] 

y-1  («+i.;)  ^ 


^(n+2-i)  •"»  ^«+l  ) 


are  all  i.i.d. 


«+l. 
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- E [ I ] 

+ 0 = 0 . 
ft 

from  (3.6),  all  the  previous  terms  being  shown  as  equal  to  0 and  the 
fact  that  Xj’s  are  all  i.i.d. 

Hence  (3.3)  holds  for  all  k (positive  integers). 

As  a consequence  of  the  previous  lemma,  it  is  obvious  that 

1>,  ( Xj, X,, ) and  f,-  ( X,, X^_, ) for  \ i i * i'  i k. 


(3.7) 


are  pairwise  uncorrelated  and  let 


_ ..  {N-i\ 

' [k-j}’ 


then  we  have 


ol  = Var  (U^)  = 


* Var  ( Uj  ) 

h[(k-j)\]^ 


[ (N-l)  (N-2)  ...  (N-k+l)  f 


N E gli  X,  ) 


[ (N-2)  (N-3) 


[ ( k-2)  ! ] 


(N-k+l)  fN\ 

[2 


E V2(  ) 
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N \ r.  .2 


,k-l^ 


E ft-l  ( ^2’-’  ) 


'AT' 


E i|r^  ( X„  X^,  ...,  X*  ) 


= 


^Li.t-1  ‘N‘E  gli  X,  ) 


+ X 


N-2.K-2 


'N' 

a) 


E Xj,  X2  ) 


■^  ■ ■ ■ ^ 4-t.i,i  ( E 4,;.,(  x„  X, x,_, ) 


+ X 


2 fiV' 

Af-*.0  , 


E ( Xj,  X2,  ...,  Xj  ) 


(3.8) 


[ V Xi's  are  all  i.Ld.  ]. 


If  it  is  assumed  that  = E (X^)  > 0,  then  first  term  dominates  the  right- 

hand  side  of  equation  3.8  and  oj  is  asymptotically  normal  as  N -*■  00  (c.f. 
Hoeffding,  1948,  where  U-statistics  were  introduced).  Also  note  that 

oj  = O ( ) , 
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i.e., 


o„  = O ( ) 


(3.9) 

['.■  Xj’s  are  all  i.i.d.] 


3.2  U-Statistic:  Degree  K 

We  now  prove  two  lemmas  which  are  needed  to  derive  the  approximation  for 
the  cumulant  generating  function  of  both  one  and  two  sample  U-statistics.  We  use 
Hoeffding’s  decomposition  in  both  these  derivations. 

Lemma  3.3  : 

Assume  E \ h ( X^,  | < oo  for  some  5 > 0, 


and  further  define  5.  = • * * * X) 


then 


E ( I Sjj^  P"*  ) ^ 0(  ) 


(3.10) 


Proof:  Let  us  consider 


= 5^-5: 


'•'2* -*'1  ■ * 


' E E 1'2(  ) 

12=2  i,  = l 
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N 


<2=2 


Where  Sj  q = 0 and 


^ I ■^1,0 


’ ^1.<2-2 


) = 0 


L = 2,3,4...,  N. 


By  lemma  3.2. 


Thus  if  82^1  = 0,  then 


*2 


<2=2 


k^  = 2,...,N. 


has  a martingale  structure  and 


are  its  summands.  Using  a result  of 


Dharmadhikari,  Fabian  and  Jogdeo  (1968)  on  bounds  on  the  moments  of  martingales 
for  k2  = N and  C2  = constant,  we  get 


max 


(3.11) 


but  for  fixed  i2  > 2 
it 

*u=  E ViCXi,  .\)  ; 1 ij-i. 

.,.1 


is  also  a martingale  and  by  applying  Dharmadhikari,  Fabian  and  Jogdeo’s  (1968) 
result,  again  we  have 
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I ^1,4-1  P'®) 


max 

lii,sij 


B I i|.j(  X,^  , X^  ) I 


= C,  • (ij-l)'**'’  • B |4r(  X,  , Xj  ) |2-‘ 

= C,  • (ij-l)‘**'"  • B I gj(  X,.  Xj  ) - 

- «,(  X,  ) - «,(  Xj  ) i^-* 

£ C,'  -(ij-l)'**'"  • [ E I gj(  X,,  X,  ) |2*»  4- 

* B I g,(  X,  ) + B I g,(  Xj 

from  Q inequality  (C.  R.  Rao,  1973). 

^ Cf  • . £ I Xj,  ...,  X,) 

from  theorem  9.1.4  (K.  L.  Chung,  1968), 

and  thus  from  equations  3.11,  3.12  and  ^ I ^i»  ^2»  ) I °® 


E I ^2^  1^"*  ^ 0(  ) 

Again  consider. 


= S ij/  ( X 


N '3-1  «2-l 


= E E E t3( 

<,=3  <2=2  i,=l 


) ] 

(3.12) 

we  get, 

(3.13) 
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N 


= E 

1,-3 


-1 


Note  that  we  already  know  that  ; ij  = 3,  4,  ...  , N.  has  a martingale 


structure  and  furthermore, 


E ( 52.^,-!  I 5 


'2.1 


■^2.1, -2  ) 


= 0 


i,  = 3,4 N. 


by  lemma  3.2. 


*3 

\ij-l 

<3=3 


; k,  = 3 


> • • • > 


iV. 


also  has  a martingale  structure  and  Sj  j = S32  = 0.  Thus  by  again  applying 
Dharmadhikari,  Fabian  and  Jogdeo’s  (1968)  result  for  kj  = N we  have, 

E I p-  s C,  • (W-2)'-«  • E(  I p-  ) 


where  C3  is  a constant,  but  from  (3.13)  we  have, 

^(1  l'*‘ ) ^ o(  I i,-l  P'* ) 


max 

3^i3^iV  ^ ( I I 


2 + ft 


) ^ 0(  ) 


and  hence 


^ ( I ^3,N  1^"*  ) ^ ;y3(1.8/2)  ) 


(3.14) 
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Now  let  us  assume  that 


E ( I ^ 0(  ) 


holds  for  some  j > 2.  Now  consider, 


E 

l^ij< 


•E 

<i;SN 


♦/( 


X,,) 


(3.15) 


= E E 


E ’I'/f 


i,.i 


N 


= E s, 


Note  also  that  if  Sj.i  i = Sj  j 2 = . . . = Sj  ^ j j = 0,  then 


E ( I 5^.,  1 .....  ^^.2  ) = 0 V z.  = y.  7+I M 


by  lemma  3.2. 


Thus 


= E 

i,.i 


*;• 


= i . ;■+ 1, 


N. 


has  a martingale  structure.  Again  using  Dharmadhikari,  Fabian  and  Jogdeo’s  (1968) 
result  for  kj  = N we  have, 


E ( I S.^  ^ C,  -(N-y+l 


max 


E ( I 1^^^  ) 


31 


^ O ( ) from  (3.15). 

Therefore,  we  know  that  if  equation  3.10  holds  for  j-1,  then  it  also  holds  for 
j and  hence  by  principle  of  mathematical  induction,  equation  3.10  holds  for  any  j. 
Lemma  3.4  : 


Under  E \ h ( X^,  ...,  \ ^*  < <»  for  some  5 > 0, 


then  we  have 


E \ U . \ = O { N ) 


(3.16) 


Proof: 


[ £ I U,  I 2-'  ] 2-'  = [ £ I * ...  * U„  p-*  ] 


1 

2+fl 


5 [ £ I I j ^-*  *...*[£  I i/J  "•*  ] 


1 

2+8 


by  using  the  Minkowski  inequality. 

Let  A ( Xj,  . . . , ) be  the  kernel  of  degree  k for  an  appropriate  U-statistic  and 


let  us  consider. 


£ It/,  I ^-»  = £ I ( X,,  , ....  ) 


E ••••£ 


2 + 8 


^ ‘ ‘ ^ il;  tX  X t |(2+8) 


= O ( //<2*8)(*-;)  ) . £ 
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(2)(2  + 8) 

^ O ( ) ■ O (N  ^ ) 


from  lemma  3.3. 


= O ( ^ O ( N ]<2+«)  ) 

where  j = i+1,  i+2,...,  k. 
Note  that  our  main  assumption  in  the  derivations  of  lemmas  3.3  and  3.4  was 


E I h(X^,  ...,  X,  ) I < 


where  S > 0. 

We  will  now  use  the  above  two  lemmas  together  with  the  Hoeffding’s 
decomposition  to  derive  the  approximation  for  the  cumulant  generating  function  of 
a general  one  sample  standardized  U-statistic  based  on  a kernel  of  degree  K.  Here, 
we  assume  S G (0,1]  in  the  previous  two  lemmas.  Let  us  consider 


(f  ) = £ exp 


'N 


= E exp 


t Y - 


=E  exp 


t 

* '"*)] 

i (k-1)! 

' * 4 • 

(k-2)! 

' ij 

' 

^ E fi  a,) 

t 

\ 4^  ">  ] 

' 

exp 

exp 

% 

/ — s 
1 

*• 
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N 

JJ  E exp 


<=1 


'N 


+ E 


N 

n ®^p  ^ 


X 


^ ( UJ(k-2)\  + U,Kk-3)\  + C/_3  ) ■ 


+ E 


N / 

n ®^p  ^ 


Vi.t-1  ti  ( ^ 


/ iUJ(k-2)\  ^ U_^)^ 

t — 

m L 

2 


£ 

2 + « ' 

1, 

/ 

for  any  5 £ (0,1]- 


(3.17) 


where  1/  . = 


U, 


j‘t* 


1 ( )! 


Note  that  from  lemma  3.4  we  have 

E |t/..|  2 * * = O ( (A^*-0^iV2)  2 ^ . 
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for  S > 0. 

and  also  observe  that 

= 0{N^  ) . (3.18) 


n 

<=i 


Vi.i-1  ti  ( 


u 


'N 


and  the  interaction  term  within  the  square  term  in  equation  3.17  is  also  of  the  order 
0(  ) (3.19) 


Let 


Yjv  ( O 


E exp 


/ 


then  from  equations  3.18,  3.19  and  lemma  3.4  we  have, 

♦k  ( < ) = ( ( ) 


+ Yn  ( O 


'‘N-2,k-2  /N^ 
V 2 1 


/A. 


'N 


\ 


'N 


E llljC  ) lll^(  X2  ) t2(  ^2  ) 


Yn  ( O 


'"N-3JC-3 


'N 


(S') 


'N 


3 

/ 


• E i|fi(  X,  ) T|;j(  X,  ) tjf  X3  ) tjC  Xj,X2,X3  ) 
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X \ 


E t?(  X,  ) Hr,(  X^  ) ijf^C  X,PC^  ) 


N-2 


+ 


(t)  't^  ' 


• X 


2 

N-2,k-2 


(^)E^l(X,X,) 


, ^-3 

+ 6 • Yiv 


i.  t 

^N-2,k-2  fNs, 

T > 

2 • Ojy^ 


1 \ 2 
'^S-l.k-l 


'N 


• £ Tlf,(  X,  ) l|f,(  X2  ) y\f^(  Vi  ) t2(  ^2»^3  ) 


+ 6 


yN  ( o 


'■N-2,k-2  ^ 

;■  ^ 4 >> 


A.  \* 

'^N-l.k-X 


'N 


• [ £ t,(  X,  ) i|,j(  X2  ) Ili2(  Xj,X2  ) ] 2 


2 

+ 0 ( I N 2 f 1 ) 


(3.20) 


This  expansion  of  the  cumulant  generating  function  for  the  one  sample  U- 
statistics  with  a kernel  of  degree  k,  reduces  to  P.  J.  Bickel,  F.  Gotze  and  W.  R.  Van 
Zwet’s  (1986)  expansion  of  the  cumulant  generating  function  for  one  sample  degree 
two  case.  In  fact,  there  is  only  one  additional  term  in  this  expansion.  This 
additional  term  of  order  0(N'^)  is  important  for  U-statistics  with  kernel  of  degree 
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> 3,  i.e.,  if  order  is  equal  to  two,  this  term  dissappears. 

3.3.  Example 


We  used  the  Wilcoxon  Signed  Rank  test  statistic  as  an  example  to  test  the 
accuracy  of  the  cumulant  generating  function  approximation  as  defined  on  the 
previous  page.  Note  that  the  signed  rank  test  statistic  is  actually  a linear 
combination  of  two  U-statistics.  Since  it  is  possible  to  get  hold  of  the  exact  cumulant 
generating  function  of  the  signed  rank  statistic,  it  was  easy  to  carry  out  this 
comparison,  i.e.,  see  Figure  3.1.  Cumulant  generating  function  based  on  the  first 
four  (exact)  cumulants  has  also  been  included  in  the  comparison  as  we  wanted  to 
investigate  the  behavior  of  resulting  saddlepoint  approximation,  Easton  and 
Ronchetti  (1986),  based  on  it  also.  We  were  also  interested  if  eq.  (3.20)  could  be 
successfully  used  to  approximate  the  tail  probability  by  the  saddlepoint 
approximation  as  given  in  equation  2.8  (Chapter  2)  and  at  the  same  time  also 
provide  a viable  alternative  to  large  sample  (normal)  approximation. 

To  use  Hoeffding’s  decomposition  we  consider  the  Wilcoxon  Signed  Rank  test 
statistic.  Let  Xj,  ...,  be  i.i.d.  random  variables  with  a common  continuous 
distribution  function  F.  Let 


N 


^ I J ] 


(3.21) 


be  the  signed  rank  of  X;  and  also  let  denote  signed  rank  test  statistic  for 
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Wilcoxon  Signed  Rank  Test 
Using  Exact,  C.G.F.  (approx.)  and  C.G.F.  (K4) 
Comparison  of  Cumulant  Generating  Function 

Figure  3.1 
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testing  the  hypothesis  that  the  distribution  of  Xj  is  symmetric  about  zero,  thus 


~ ^ i 0 ] ' 


(3.22) 


Note  that  ^ \Jr  ( ) is  just  the  signed  rank  of  X^j)  where  X(j)  is  the  i***  order 

i=l 


statistic  in  the  sample  and 


T|r(X)  = 


1 if  X>  0 


0 otherwise. 


(3.23) 


Let  us  consider 


E • • • E f ( X.  + X.  ) 


N 


= N -U,(  X,,  ...  , X^  ) + " • U,(  Xj,  ...  , X^  ) 


where 


N 


V,  ( X. ^ E » ( X, ) 


39 


and 


U,  ( X,,  ...  , ) 


, Jl  E ■ ■ •E 

1 i < j ^ N 


t ( X.  + X . ) 


- E{W;)^NPiX^>0)*  [ P ( I X,  I < I Xj  I , > 0 ) 

• />  ( I XJ  < I X,  I . X,  > 0 ) ] 

QO 

= N { \ - F(0)  ) + N {N-\)  f P (-X  < X.<  x)  dF(X) 

0 


= N(  1 - F(0)  ) + N (N-1  ) f (F(x)  - F (-x)  ) dF(x) 

0 


Now  let  us  consider, 

- F jf; 


N 


= E ) 


E •••E 

I i i < j ^ N 


T|f  ( X.  + X . ) 


- N (1  - F m ) - N ( N-l)  f ( F (x)  - F (-X)  ) dF(x)  . ^3^4) 


Then  is  clearly  a U-statistic,  and  it  is  well  known  that  it  can  be  written  (by  using 
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Hoeffding’s  decomposition)  as  follows: 


- ( w-i)  E «,  ( X, ) - E E ( X,,  X, ) , 

1 = 1 i = l y=i  + l 


(3.25) 


where 


gj  ( X ) = ( 1 - F ( -X  ) ) - / ( 1 - F ( -X  ) JF  ( X ) 


N 


^ [ Ijo  „^(x)  - 1 + F (0)  ] 


and 


11T2  ( X,  y ) = - ( 1 - ^ (-J^)  ) - ( 1 - F (-y)  ) + 


f ( 1 - F(-x)  ) rfF(x) 


Under  this  setup,  B.  J.  Bickle,  F.  Gotze  and  W.  R.  van  Zwet  (1986)  defined 


( O = F exp  [ r 0;^^  (N-l)  g (Xj)  ], 


which  is  the  moment  generating  function  of  Of^  (N-\)  g (X,) 


E « ( X, ) “<>  = E E ♦( X, , X, ). 

i=l  ;=i+l 


i=l 
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where  Xj’s  are  all  independent  and  identically  distributed  random  variables. 

Then  they  derived  the  following  approximation  to  the  moment  generating 
function  of  Wilcoxon  Signed  Rank  test  statistic. 


yUt) 


+ 


N-2 


(O 


1 • (’  a;,’  w E S ( X,  ) g ( Xj ) iM  X, , Xj ) 


+ 4 • £ i|r'  ( Xj  , X2  ) 

4 


+ 


( r ) r^  o-^  X 


£ g ( Xj  ) g ( X2  ) I|r  ( Xj  , X3  ) iK  X2  , X3  ) 

+ 4 N’^[EgiX,)g(X,)^}f  (X,,X,)]^ 

O 


+ ( I Y//  ( O l^’'‘  P ( I t I ) + I IV  2 r 1 


(3.26) 


where  P is  a polynomial. 

Note  that  this  result  is  a special  case  of  our  approximation  to  the  moment 
generating  function  of  one  sample  U-statistics  based  on  a kernel  of  degree  k (i.e., 
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where  k = 2)  given  earlier  in  this  chapter  (equation  3.20). 

We  used  the  result  from  the  equation  3.20  (for  approximating  the  moment 
generating  function)  to  further  approximate  the  tail  probability  by  the  saddlepoint 
methodology  as  given  in  Chapter  2,  equation  (2.8).  To  do  this  we  first  had  to 
approximate  four  expectations  under  the  null  hypothesis.  Any  standard  numerical 
integration  technique  / package  could  have  been  used  for  this  approximation.  Since 
U-statistics  are  distribution  free,  these  expectations  are  independent  of  the 
underlying  distributions  as  long  as  the  distributions  are  symmetric  about  zero.  The 
approximated  values  of  these  expectations  were: 

(i)  E,  = Eg(X,)giX^)^(X,,X^)  = 0 . 

(ii)  E^  = E g\  X,  ) g(  Xj  ) i|i(  Xj  , Xj  ) = -0-00346951  , (3.27) 

(iii)  £3  = £ g(  Xj  ) gi  Xj  ) iK  Xi  , X3  ) t(  X2  , X3  ) = 0-00173457, 

(iv)  E^  = E t^(  Xj  , Xj  ) = 0-083333  . 

Note  also  that  except  for  these  expectations  everything  else  in  equation  3.20  (for  k 
= 2)  is  either  a constant  or  a function  of  t.  Thus,  when  we  use  eq.  3.20  to 
approximate  the  cumulant  generating  function  (of  signed  rank  test  statistic)  and 
henceforth  equation  2.2  in  Chapter  2 to  evaluate  the  saddlepoint  (which  we  need  for 
saddlepoint  approximation),  we  end  up  with  a function  of  t,  which  needs  to  be 
minimized  with  respect  to  t.  The  value  of  t for  which  this  function  is  minimized  is 
the  saddlepoint. 

Besides  the  normal  approximation  we  also  computed  two  other  saddlepoint 
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approximations. 

The  first  approximation  was  based  on  the  exact  cumulant  generating  function 
and  since  the  tail  probability  based  on  this  approximation  was  exactly  the  same  to 
that  of  the  null  distribution  of  Wilcoxon  Signed  Rank  test  statistic,  see  Table  3.1,  this 
tail  probability  approximation  was  excluded  from  Figure  3.2  to  keep  the  number  of 
comparisons  optimal. 

For  the  second  saddlepoint  approximation,  we  used  an  approach  of  G.  S. 
Easton  and  E.  Ronchetti  (1986)  where  they  proposed  to  approximate  the  first  four 
cumulants  of  the  statistic  and  in  turn,  used  these  to  further  approximate  the 
cumulant  generating  function,  used  in  the  saddlepoint  approximation.  Here,  mainly 
to  optimize  their  approximation,  we  used  exact  cumulants  (the  first  four)  to 
approximate  the  cumulant  generating  function  used  in  the  saddlepoint 
approximation.  Results  of  this  approximation  have  both  been  tabulated  in  Table  3.1 
and  plotted  in  Figure  3.2  (A-C). 

From  the  table  it  is  clear  that  the  saddlepoint  approximation  works  quite  well 
to  approximate  the  tail  probability  under  the  null  distribution  for  the  Wilcoxon 
Signed  Rank  test  statistic.  In  fact,  it  is  very  close  (up  to  the  fourth  decimal  place) 
to  the  null  distribution  (exact)  if  exact  cumulant  generating  function  is  used,  see 
Table  3.1. 

Saddlepoint  approximation  based  on  the  cumulant  generating  function 
approximated  by  the  Hoeffding’s  decomposition  also  works  quite  well.  It  not  only 
beats  the  normal  approximation  over  almost  the  whole  range,  but  it  also  beats  the 
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Table  3.1.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=  17)  between  Exact  (Null)  Distribution,  Normal  Approximation  and 
Saddlepoint  Approximations. 


w* 

«d 

Sp_app 

Exact 

Spex 

Sp_k4 

Normal 

35 

-1.964530 

0.0262973 

0.0253 

0.0252818 

0.0304044 

0.0247 

40 

-1.727840 

0.0449721 

0.0443 

0.0443508 

0.0470689 

0.0420 

45 

-1.491150 

0.0731930 

0.0727 

0.0727472 

0.0746681 

0.0680 

50 

-1.254460 

0.1127630 

0.1123 

0.1124000 

0.1137430 

0.1048 

55 

-1.017770 

0.1648830 

0.1645 

0.1645580 

0.1654000 

0.1544 

60 

-0.781079 

0.2296940 

0.2293 

0.2293990 

0.2298360 

0.2174 

65 

-0.544388 

0.3060150 

0.3056 

0.3057790 

0.3059420 

0.2931 

70 

-0.307698 

0.3913040 

0.3910 

0.3911510 

0.3911830 

0.3792 

75 

-0.071007 

0.4818850 

0.4816 

0.4818470 

0.4818480 

0.4717 

where 

= Wilcoxon  Signed  Rank  Test  Statistic 
= Standardized  Test-Statistic 

Sp_app  = Saddlepoint  Approximation  Based  on  Our  Approximation 

Exact  = Null  Distribution  of  Test-Statistic 

Sp_ex  = Saddlepoint  Approximation  Based  on  Exact  C.G.F. 

Sp_K4  = Saddlepoint  Approximation  Based  on  Exact  Cumulants  (Kj-K^) 

Normal  = Normal  Approximation 
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Figure  3.2  (A) 
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saddlepoint  approximation  as  proposed  by  Easton  and  Ronchetti  (1986)  in  JASA, 
where  the  cumulant  generating  function  has  been  approximated  by  using  the  first 
four  cumulants  (exact). 

For  an  extreme  test  statistic  value,  when  the  tail  probability  becomes  very 
small,  i.e.,  the  standardized  test  statistic  becomes  quite  large,  and  as  a result  the 
saddlepoint  also  relatively  becomes  large  and  then  the  saddlepoint  approximation 
(based  on  the  cumulant  generating  function  based  on  Hoeffding’s  decomposition) 
tends  to  lose  its  accuracy.  The  reason  for  this  is  that  for  a large  value  of  t,  the 
cumulant  generating  function  approximated  by  the  decomposition  theorem  may  not 
be  too  accurate  and  hence  the  saddlepoint  itself  may  be  less  accurate.  If  this 
becomes  a problem,  we  suggest  that  some  of  the  higher  order  terms  may  need  to  be 
considered.  It  may  be  worthwhile  to  note  that  this  is  not  a concern  for  saddlepoint 
approximation  based  on  exact  cumulant  generating  function,  which  beats  the  normal 
approximation  over  the  entire  range. 


CHAPTER  4 

TWO  SAMPLE:  U-STATISTICS 


In  this  chapter,  we  look  at  the  two  sample  U-statistics  and  formally  derive  the 
approximation  for  cumulant  generating  function  (similar  to  the  one  sample  case, 
discussed  in  Chapter  3)  based  on  the  Hoeffding’s  decomposition. 

We  start  by  giving  a detailed  proof  for  two  sample  U-statistics  based  on  a 
kernel  of  degree  (1,1)  in  Section  4.1.  This  will  not  only  allow  it  to  be  used  directly 
for  approximation  of  cumulant  generating  function  for  any  two  sample  U-statistics 
of  degree  (1,1)  such  as  the  Mann-Whitney-Wilcoxon  statistic,  two  sample  proportion 
problem,  etc.,  but  also  make  the  proof  for  the  general  case  easier  to  understand. 

In  Section  4.2  we  briefly  discuss  the  derivation  of  an  approximation  for  the 
cumulant  generating  function  of  a general  two  sample  U-statistic  based  on  a kernel 
of  degree  Kj  and  K2.  The  proof  discussed  in  this  section  is  an  extension  of  the  one 
sample  U-statistic  degree  K derived  in  the  previous  chapter.  We  also  apply  the 
lemmas  proved  in  Chapter  3. 

Finally,  we  discuss  an  example  (in  section  4.3)  to  which  we  apply  this 
approximation  for  cumulant  generating  function  and  use  it  to  approximate  the  tail 
probability  by  using  the  saddlepoint  methodology  and  compare  the  results. 
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4.1  Two  Sample:  Degree  (1.1) 

Let  Xi,  X2, X„  be  independently  and  identically  distributed  (i.i.d.)  random 
variables  with  distribution  function  F(X)  and  let  Yj,  Y2,  Y„  be  i.i.d.  random 
variables  with  distribution  function  G(Y).  If  h is  a measurable  function  of  two 
arguments 

h ( X^,  Yj  ) 

which  is  trivially  symmetric  within  its  X’s  and  Y’s,  then  a two  sample  U-statistics 
(Un,.n)  based  on  a kernel  h,  of  degree  (1,1)  is  defined  as  follows: 

t'..,  = E E * ( 

1-1  /-I 

Note  that  again  we  consider  a standardized  version  of  two  sample  U-statistics 
of  degree  (1,1),  where  we  assume  without  loss  of  generality  that 

Eh(X^,Y^)  = 0 and  £ ( Xj,  7^  ) < 00  . 

Notation  & Assumption: 

Let  us  define, 

E *2  ( ) ' <"  E £ ( * (*,-.  10  >1  *} ) 

" E Si  ( X, ) = » E £ ( * ( JT,,  J", ) I X, ) 

i=l  1=1 
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A„.  = E E 4r  ( Xp  Yj  ) where 

i=l  y=l 

^{x,y)  = hix,y)-g^{x)-g^{y) 

Such  that 

U = U + U + A . 

m,n  n m mn 

Since 

£ ( i|f(  Xj,  y,  ) I X,  ) = 0 and  £ ( i|f(  ^1,  1^1  ) I ) = 0 , 

the  random  variables  gi(  X;  ),  g2(  Yj  ) and  tp(  X;,Yj  ),  1 < i < m,  1 < j < n,  are 
pairwise  uncorrelated  and  hence 

<n  = = 0^(t/„„) 

= oUU„)  ^ oUUJ  . a^(A„„) 

= n E g^{  Fj  ) + n'^  m E g^^i  Xj  ) + n m £ \lf^(  Xj,  Fj  ) . 

If  it  is  assumed  that 

0^  2 = E gj2(  Xj  ) > 0 and  = £ g/(  F^  ) > 0 , 

then  (/  (U„,)  and  (Uq)  dominate  the  right  hand  side  of  (4.  ) and  also  ^ 

is  asymptotically  normal  as  n,  m oo  (c.f.  Hoeffding,  1948).  If  we  further  assume 
that  there  exists  a c > 0,  such  that  m = c n,  then  we  have 
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V,  = 0(n~^)  i.e.  = O ( n 


and  similar  to  lemma  3.4.  it  can  be  shown  that 


E I A. 


\2*6 


= O ( ) 


Let 


( O = £ exp  { r /n  ^2  ( ^ 


be  the  moment  generating  function  of  t a„  „ m g2  ( ) and 


Y„  ( O = £ exp  { r o'J„  n ( X^)} 


be  the  moment  generating  function  of  t n g^  ( ) . 


Now  let  us  consider, 


( O = £ exp  { r } 


E exp  it  o„‘„  U„  + t } exp  { r ) 


( N 


= E 


n ^ ^ °m.»  rf'  82  (Yj)]  n ^ ^ ^m.n 
V /=!  V «=1 


exp  ^ t ^m.n  E E t ( ^ 

i=l  y=l 


n (X.)  } 
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n \ / m \ 

n E expit  m g^iYj)}  H E exp  i t o'J„  ng^(X.)} 
j=l  y V i=l 


f " 


+ E 


n ^m]n  mg^iYj)) 

\ y=i 


n ^ ^ Om.n  n ( X.  ) } 
V '=1 


/ m n 

-1 


^ 0«.n  E E ) 

1=1  y=l 


+ £ 


/ « \ 

n exp  { r o~J„  m g^(Yj)} 

V y=i 


''  m ^ 

n ^ ^ ^m.n  « ^l(  ^ 

V «=i 


/ 

^ m n \2 

r -2 

o„,  — 

M /nffl 

\ ^ 

E E ♦ ( 

[ i=l  ;=1  ) 

+ O (E 


2*6 


) 


for  any  8 G (0,1]. 


y:  ( o • y:  ( o 


w~i  / ^ \ jif—i  / ^ \ 

+ Y„  ( O • Y„  ( O 


t o 


1 

m,n 


l'”' 

.ij 


X 


£ exp  { f ( m gj  ( ( Xj  ) ) } ij;  ( ) 
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(4.2) 


+ E 


( " 
n 

V /=! 


\ 

1 

1 

m 

i=\ 


-2  ‘ 
— o_  _ 

2 ’ 


5;  i|r^  ( X,,  ) + 5^  1|>(  X,,  y^ ) If  ( x,„  y,^ ) 


V 


i * /',  j 


* Y,  4>  ( X,.  y^ ) ♦ ( x„  yy ) 

« . j*  / 

+ 5;  4r  ( X,,  y.  ) ilr  ( X.;,  Y.,  ) 

«■  * «' . j * / / 


+ O [ E \ t o’* 

\ I w*,n  ;i 


|2  + 8 


wAcre  o^  ^ = O ( n ^ ) wAc«  /n  = cn  a/u/  E | = O ( n‘ 


Note: 


( exp  ( f o,;„  w ^2  ( y, ) ) - 1 - f o;,;„ 

( exp  ( t a'J„  n g^(X^))  - 1 - t 


exp 


t n (X^) 


m,n 


W gj  ( ) ) X 

e„!„  n gi  ( Xj  ) ) 


(4.3) 
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- E ( ' o.;,  " «,  ( X,  ) ) ' U2  ( >'i ) 


v=0 


+ t 0«.n  « 


/ ( tmg^  (Kj) 

exp  

a 

m,n 


- E ( ' ""I-  ">  «2  ] Si  ( ) 


v=0 


+ <^m.n  n m g^(X^)  g^iY^) 


+ I m 81  ( Xj  ) ^2  ( ) 


+ I <^m!n  n g,  ( Xj  ) g2  ( Xj  ) 


( Xj,  Y,  ) 


^ [ ( exp  ( ? o;,|„  ( m ^2  ( Xj  ) + n ( Xj  ) ) ) - exp  ( r 
- f ^m,n  « 5i(  -^1  ) exp  ( f m ^2(  Xj  ) ) - exp  ( 

+ 1 + ^ ^m,n  «5l(  ^y)  - t ^mln  ^ Xj  ) CXp  ( f O, 

+ t Om!„  m g^{  Xj  ) + m n g^{  Y^  ) gj(  Xj  ) ) 


/ f m ^2(  X.  ) / -1  1 1 

^ ^ « 5i(  Xj  ) ) - f 0„  „ 


m 


m,n 


e„I„  m gj  ( Xj  ) ) ) 

f V„  /I  gi(  Xj  ) ) 

i « Syi  X,  ) ) 

^2(  Xj  ) 
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- m n ) gjC  ^1  ) - T sli  -^1  ) gii  ) 

( t n gA  X.  ) , , V _i 

exp  t wi  gjC  ) - ^ Om.«  « 5i(  -^1  ) 


- w n g^(  Xj  ) ^2(  ^1  ) - I 


-3  : 

» w 

m,/l 


« gi(  ^1  ) g!(  Y,  ) 


+ « »»  ^i(  ) gi(  Yi  ) 


+ I "»  5?(  -^1  ) gi(  Yi  ) 


+ 1/2  n m^  g^(  Xj  ) gj(  Xj  ) 


ilf(  X,,  Xj  ) 


= Eexv(t  o„;„  ( m g2<  1^1  ) + « X,  ) ) ) t(  X„  Y,  ) 

- £ exp  ( f o;,|„  m g^i  Xj  ) ) i|j(  Xj,  Xj  ) 

- E exp[t  a'J„  n g^i  X^  ) ) t(  X^,  X^  ) + £ iK  Xj,  Xj  ) 

= £ exp  ( f o;,|„  ( m g2(  l^i  ) + « gi(  ^1  ) ) ) t(  Xj,  Xj  ) 

which  is  the  last  factor  in  second  term  in  equation  (4.2). 

Also  note  that  the  first  term  in  equation  (4.3)  is  a high  order  term,  i.e. 

^ ( exp  ( f o-J„  mg^iY^))  - 1 - a'J,,  m g^(  Y,  ) ) x 
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(exp  (t  a'J„  n gj(  Xj  ) )-  1 - f o'J„  n g^(  ) ) 


exp  ( ^ ( m gj(  Xj  ) + n gj(  Xj  ) ) ) - exp  ( f ( 


- ^ ” «i(  -^1  ) exp  ( f o;.|„  TO  gj(  7j  ) ) 


- exp  ( f « gj(  Xj  ) )+  1 + r a„;„  n g,(  X,  ) 


- f em!«  »»  gi(  ^1  ) exp  ( f a„;„  /i  g^(  X^  ) ) + f o„;„ 


TO 


o.!»  ”>  n S2<  I'l  ) «i(  *1  ) X -K  *1.  J’l  ) 


( 1 + ' <’»!«  ( <"  *2<  I"!  ) ■»  n Si(  X,  ) ) 


2 ^-2 


a 


( TO  g^  (Y,)  - n g,  (X,)  f 


+ 1/6  • ( TO  gjC  Xj  ) + n gj(  ) f ) 


t . -1  . V ^ ^2  ( 5^1  ) 0 

1 + f <^m.n  m g^C  Xj  ) + ^ L_  + 


-1 


- t ^m.n  « ^l(  -^1  ) 


f2  _-2  : 

-1  „ . ^ T,  X ^ ^m.H  ^ 


1 + t <^m,n  fn  82^  Yi  ) + 


t(  X„  Y,  ) 

^«ln  l"l  ) ) 

^2(  1^1  ) 


m.n  g2^  X^  ) 
6 

' ^2  ( 1^1  ) 
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, , -1  ,v^  SU  ^m.n  gh  ) 

1 + t V«  " «i(  ^1  ) 2 " ^ 


6 


+ 1+  t o'J.n  ng^iX^)  + t a'J„  m g^C  Y^  ) 


f^m.n  "»  « 82(  Yl  ) 8i(  ) 


- t <^m.n  m g^{  Ki  ) 


1 + t Om!„  n 5j(  Xj  ) + 


otn  8h  X,  ) 


\ 


/ 


+ o ( «-"  tM  1 • iM  X,.  y, ) 

where  we  assume  that  there  exists  a constant  c (greater  than  zero)  such  that  m = 
cn. 


= E 


2 "Z  c g^(  y, ) g,(  X, ) 
2 


3 otn  g 8,i  X,)g^jY,)[cn  ^ n g^(  X,  ) ) 

6 


^ 8,i  X,  ) g,(  Y,  ) 


^m.n  82  ( Xi)  8i(Xi) 

2 


- <„  c g,i  Y,  ) g,i  X,  ) 

+ Vn  C g^i  Xj  ) gj(  Xj  ) 
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'i'  c sU  X,  ) g,(  Y,  ) 


• iH  Xj,  Kj  ) -H  O ( n-^  ) 


= O (n-^  t*)  (4.4) 

Finally  also  observe  that  the  second  term  in  equation  (4.3)  is  also  neghgible. 
This  can  be  shown  as  follows. 


-1  ♦ -1 

f ^ « ^i(  ) ) 

2 t a„  . m 

m,n 

exp 

^ m,n  y 

j.  ( ^ <]n  n 8i(  Xi  ) )' 

v=0 


V ! 


g^(  T,  ) • t(  ^1,  Y,  ) 


^ 2 1/  o:‘.  I m E \ g^(X^) 


m,#i 


IM  Xj,  7j  ) ^2(  ) 


V e e [ 0,1  ] 


If 


0 = 6 e = — — and  m = n c for  some  constant  c,  then  we  have 
2 + 6 


4*6 

= 2 I / a~J„  n I c'  £ I g,(  X,  ) |2  - » | i|;(  X^,  Y^  ) g^(  Y^  ) 


V 0 6 [ 0,1  ] 
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.2 


E gti  X,)‘E\g^{Y,) 


2*6 

1+6 


1+6 

2+6 


[ £ I l|r(  X„  Y,  ) p-‘  ]"•“ 


By  Holder’s  Inequality. 


= O ( I n 2 f I ) 


(4.5) 


: a = O ( n ) when  m = c • n 


Similarly,  we  can  show  that 


2 t a'J„  n 

exp 

( tmg^iY,)^ 

*• 

\ m.n 

2 / 

t o’* 

m. 

n "»  gl(  ^1  ) )' 

v=0 

V ! 

/ 

1 

g,i  Xj  ) • ili(  X,,  y, ) 


^ 0[  \ n ^ t \ j 

Now  let  us  consider  the  third  and  final  term  in  equation  (4.2). 


/ « 


n ^ °m!i.  g^i  Y.  ) ) 

V y=i 


^ m 

n ®^P  ( ^ ^n.n  n gj(  X.  ) 
1=1 


(4.6) 
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i!  ^ 
2 


X,.  yp  + E iK  X,,  y^ ) iK  x,„  y. ) 


i * i\  j 


* Y;  4>  ( X,,  y,  ) >|r  ( X„  Yy  ) 


t .J*J 


* Y ♦(  ) 'K  ^i'.  1^' ) 


I -*  i'  , j * }' 


^yl~^  (O  ’ yT^  ( t)  ' o'J,,  m n E i|r2(  X^,  ) 


^yl'^  (O  ' yT^  ( O ‘(t  a~jy  2 


2)[l 


n 


£ «,(  X, ) *,(  Xj ) <<(  x„  y, ) iH  Xj,  K, ) 


♦ I y:'"  ( o • rr'  (<)•(<  o,;,  ><  2 


n \ I m 


UMi 


E *2(  y, ) «2<  yj ) if(  x„  y, ) ik  x„  y^ ) 


1 n-2  / . X m-2 


.-1  n6  _2  „2  ( 


• 2 Y«'  ( O ' Y„'  ( r ) • ( n'  ^2j  1 2 


[ £ s,(  X, ) gj(  y, ) i|/(  x„  y, ) ] 


o ( »-’  (‘  ) 
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where  m = n • c. 


2 


/l*"l  / ^ \ 

Y„  ( O 


W”!  / ^ \ 

Y„  ( O 


m n £ Xj,  7j  ) 


+ 1 yI’^  ( ^ ) y”'^  ( O «2  ^ ^ 


£ g,(  X^  ) gj(  X,  ) t(  Xj,  Xj  ) il;(  X^,  Xi  ) 


1 

+ — 
2 


y!T^  ( t ) y"'^  ( O n («-l)  m x 


£ ^2(  ) ^2(  ^2  ) ^2  ) 


1 w~2  / ^ \ / ^ \ 

2 Y„  ( O Y„  (Or  o„  „ 


m^  n^  n («-l)  m (m-1)  x 


+ o ( /tO'*  ) 

where  m = n c. 

Thus  from  equations  (4.2)  - (4.7),  we  can  say, 
4>^  ( O = Ym  ( O • Y^  ( O 


/I  1 / ^ \ l?l“l  / ^ \ 

+ Y«  ( O • Y„  ( O 


(4.7) 


X 
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^^-5-  E g,(  X, ) gj(  y, ) ♦(  X,,  y, ) 

n 

ntfH 


* I ”«!«  n^<nE  «?(  X,  ) «j(  y,  ) ♦(  x„  y, ) 
- 1 C » »■"  £*,(  X, ) &'(  y, ) ♦(  Xi,  y, ) 

* i £ K"(  Xi.  y, ) 

^ o 

m,n 


+ — 


1 n-1 


m-2 


Y„  * (O'  vr  ( O o;.;  m (m-1)  x 


E gj(  Xj  ) gj(  X2  ) 4r(  X„  7i  ) t(  X2,  Y,  ) 


1 '•“2  / , \ m-1  / -4  I ^ ,v 

* 2 ^ ' Y„  ( O n (/i-l)  x 


E g2(  Y,  ) g^(  Y,  ) iK  Xj,  ) 4r(  X,,  y^  ) 


#1-2 

m 


( O • Y 


m-2 

n 


(O 


^6  ,*,3  --3 

^ («-l)  (m-l)  X 


[ £ gi(  X,  ) g2(  I'l  ) ’!'(  J'l  ) ]' 


(4.8) 
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( 

1 ^ 

+ 0 

, 1 Yi'"  ( 0 • Yr'  ( ' ) 

• P ( 1 r 1 ) n-^  ® + 1 n M ^ 

where  P is  a polynomial. 


4.2  Two  Sample:  Degree  (K,,K^) 


Let  Xi,  X2, ...,  X„  be  independently  and  identically  distributed  (i.i.d.)  random 
variables  with  distribution  function  F(X)  and  let  Yj,  Y2,  ...,  Y„  be  i.i.d.  random 
variables  with  distribution  function  G(Y).  If  h is  a function  of  K,  + K2  arguments, 

* ( ■ • • ’ >'/. 

which  is  symmetric  in  its  Kj  X’s  and  K2  Y’s,  then  a generahzed  two  sample  U-statistic 
(Gni.n)  based  on  a symmetric  kernel  h of  degree  Ki  and  K2  is  defined  as  follows: 


If  X.  = (X. , ...,  X.^  ) and  X.  = (X.^.  ....  X.^^)  then 


U 


m,n 


i.  < m ; 1 w'.  < ...  < 4 i < 5 ’ 5-  > 


(4.9) 

Note  that  again  we  consider  a standardized  version  of  two  sample  U-statistics  of 
degree  Kj  and  K2.  We  can  assume  without  loss  of  generahty  that 


£ A ( X , X ) = 0 and 


E P i X , X ) < oc  . 
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where  x=  (Xj,  and  Y= 


Let  us  define 


gij  ( yp  yj  ) 


= E(h(  X.  Y ) 


Xj—  JCj,  ...,  Xj—  x-f  Xj— 


rr  yj ) 


where  i = 0,1,  ...,  Kj  & j = 0,1,  ...,  K2. 


Now  define, 

%,(Y,)  = E(h(X  , X ) I X,  ) =goi  ( y, ) 


ilTio  ( Xj  ) = £ ( A ( X . X ) I XJ  = ^10  ( Xj  ) 


( ^1.  Yi  ) = ^1.1  ( Yi  ) - to.i  (Yi)~  'iTi.o  ( ) 

= Y,  ) 


- E EE  t,.,/(X,,...,X  i;.  X.  ) 

J*M  lj:iy  ^ 


where  i is  used  as  an  index  for  X,  i'  is  used  as  an  index  for  Y and  the  double  sum 
is  over  all  the  number  of  distinct  combinations  of  size  j(=0  or  1)  out  of  one  X and 
also  over  all  the  number  of  distinct  combinations  of  size  j'(=0  or  1),  out  of  one  Y. 
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Similarly, 

Y„  Y,  ) = ( X,,  Y„  Y,  ) 


2 


j*j'=l 


EE 

IJ-.2J' 


= ( X,,  Y„  Y,  ) 


- [ ( X,,  y. ) + ( X,,  K, ) + ( r„  y, ) ] 


- [ to.,  ( y,  ) - 1>,,„  ( X,  ) ^ ( ^2  ) ] 


♦j , ( x„  x„  y, ) = ( X,,  Xj.  y, ) 


2 


;>/=! 


= g2 1 ( X,,  X^,  Y,  ) 


[ ^1*1,1  ( ^l»  ^1  ) 'J'l.l  ( -^2’  ^1  ) 'i^2,0  ( ^2  ) ] 

- [ ti.o  ( ) + ^1,0  ( ^2  ) ^ 'I'O.I  ( ^2  ) J 


Ys  ) = 8j  ^ 


Y,) 
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r+5-l 


where  i is  used  as  an  index  for  X,  i'  is  used  as  an  index  for  Y and  the  double  sum 
is  over  all  the  number  of  distinct  combinations  of  size  j(=0,l, r),  out  of  a total  of 
r = 0,1,  ...,  Ki,  X’s  and  also  over  all  the  number  of  distinct  combinations  of  size 
j'(=0,l,  ...,  s)  out  of  a total  of  s = 0,1,  ...,  K2,  Y’s. 


X \ y,.  yi, ) 


- E 


E EE  f/,/ ( 

+/=1 


Also  as  we  defined  in  Chapter  3,  let 

‘ V ” ~{ x',  * («-;■)  («-2)...(n-*+l)  = 

^ ^ { n-k)] 


then  we  further  define. 


^1,0  ■ ^n,ki  ^ ) 


i=l 
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(»i-l)...(m-A:i+l)  • n...{n-k^+\)  ( X,.  ) 

i=l 


= C„.*.  E to.i  ( ) 

y=i 


= m...{m-ky  + l)  • (n-l)...(n-^+l)  E toi  ( ) 

y=i 


^1,1  “ 


-l.Vl 


E E 

i=l  ;=1 


= (m-l)...(m-k^+l)  • (n-l)...(/i-^+l)E  E ^i.i(  Vj  ) 


i=l  y=l 


u = c c 


X 


1 .i, 

1 


< -i  m 
<j,  in 


E 'Hr,  ( X,, X, , ...,  ) 
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1 i A ■<  •••  -<  ^ n 


where  C„,o  = ^n,o  = ^ • 


Then,  note  that 


(4.10) 


Where  Uo,o  = 0. 


Proof  of  the  above  result,  Hoeffding’s  decomposition  in  the  two  sample  case 
is  similar  to  the  one  sample  proof  given  on  page  19  of  this  manuscript.  A.  J.  Lee 
(1990),  in  his  book  "U-Statistics",  Theorem  3,  page  40,  also  states  the  same  result  in 
a slightly  different  set  up. 

By  using  the  independence  of  Xj’s  and  Yj’s,  together  with  the  result  of  Lemma 
3.2.  in  Chapter  3,  it  implies  that 


(4.11) 


at  least  one  of  two  groups  of  subscripts  are  different,  are  uncorrelated. 


Thus, 
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a 


2 

m,n 


*1  <1 


f'‘”-  ( V... ) = E E 

r=0  j=0 


y“r  ( Ur.,  ) 

[ ( t,-r)  ! • ( *j-s)  ! ]" 


*1  ^ 


= E E 


m\  (m-r)!  1 n\  (n-^)! 

(w-ibj)!  m!  (itj-r)!  (n-k^\  n\ 


1 

2 

f/i) 

ihi-s)  ! 

UJ 

E Vv  (X.. 


k^  fcj 


= E E 

r=0  s=0 


(m-r)! 

(m-itj)!  (itj-r)! 


(n-.y)! 

(n-k^V  (ifej-.s)! 


2 


m 


V„\ 


X 


E t;,(  Xj, ...  x^,  y,, ...  y, ) 


k^  ij 


= E E 


■((m-r)' 

' (.n-s) ' 

(m) 

'n' 

.1 

'i'-J 

E ( X, x„  y,, ....  y, ) (4.12) 

■.'  Xj’s  are  all  i.i.d.,Yj’s  are  all  i.i.d.  and  furthermore,  there  is  independence  between 
X’s  and  Y’s. 


By  choosing  m = c-n  ( for  some  c > 0 ),  it  is  quite  obvious  thato^  ( ) 


and  dominate  the  right-hand  side  and  also  note, 

U»1 
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ol.  = O ( >■'  ) 


(4.13) 


Now  based  on  An  j u j (See  page  26,  Chapter  3)  if  we  define 


ft  =1  • 1 

^m-r.n-5  ^m-r,ifej-r  ^n-s,k2-s 


(4.14) 


and  then  consider, 


-1 


4>„,„  ( O = £ exp  [ r ] 


= .E  exp 


t a 


-1 


*1  *2  TJ 

E E ^ 


r=0  5=0  (^r^)!  (^->5)5 


= E 


exp 


t a 


-1 

m,n 


(kr^V-  k,\ 


exp 


t a 


1 

m,n 


^0,1 


k,!  (k^-l)l 


exp 


tu_^  1 

o 

V «.'>  y 

where 


r+s  > i 


(ki~r)\  (k^-s)\ 


= E 


' m \ 

n exp  [ f vjrjo  ( X.  ) ] 

i=l 
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/ 


\ 


n 


n exp  [ f 

y=i 


to.i  ( ) ] 


X 


1 + t o 


-1 

m,n 


u, 


1,1 


^2,0 ^ ^ 


[ (itj-l  )!  )!  (k,-2)\k^\  k,\(k^-2)\ 


^ ^ ^3,0  ^ 

( k^-2  )!  ( *2-1  )5  ( ^1-1  V-  ( ^-2  )!  ( k^-3  )!  k^\ 


(k,-3)\ 


U 


-3 


1/2 


fo 


-1 

m,A 


u, 


1.1 


a 


2,0 


U, 


oa 


( *1-1  )!  ( *2-1  )!  (*i-2)!*2!  *j!(*2-2)! 


\ 

2 

- U-2 

»■ 

/ 

+ o(e  \t  o'J„  t/.i  I ) for  some  6 > 0 (4.15) 

Choosing  m = c n (for  some  c > 0)  and  by  using  lemma  3.4.  in  Chapter  3 
we  know  that 

E I I = O ( n (2.6)  I (4  16) 

where  6^0. 


and  also, 
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' m \ 

n ^ ^m\n  '^1.0  ( ) 1 

i=l 


(4.17) 


n \ 

n ®^P  I ^ 4^0, 1 (Yj)] 

y=i  ) 

= o 

Noting  that  interaction  terms  within  the  square  term  in  the  equation  (4.15) 
are  also  of  order  0(  n ),  thus  we  have 

4>^  ( O = y:(  O • y:(  O 


m-l/  . \ n-1 


- Yr‘(  f ) yr(  t ) (t  I'j* 


E X,  ) i|ro,(  Y,  ) ) 


1 / ^ \ /i'“l  / ^ \ “1  \4 

+ 2 ( O Ym  (t)(t  a„„y 


'n' 

5 

.ij 

m- 

'm' 

(n) 

.1> 

^ <o(  ) ^o.i(  >^1  ) 'I'uC  ) " 

e t,.»(  X. ) <.(  I'l ) l>u(  Jti.  r, ) ] 


1 W"!  / - \ if""l  / ^ \ ” 1 \' 

+ 2 ^ ) V«  ( O „) 


m.n/  m n 


£ ,l;ti(  X„Y,  ) 
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+ O ( I n 2 r I ) (4.18) 

The  preceeding  approximation  (4.18)  of  cumulant  generating  function  for  the 
general  two  sample  U-statistic  is  based  on  a kernel  of  degree  Kj  and  K2.  Although 
this  approximation  involves  a total  of  21  expectations,  some  of  these  are  the  same 
because  of  their  symmetric  nature.  It  should  be  noted  that  the  approximation  (4.8) 
for  cumulant  generating  function  of  a two  sample  U-statistic  based  on  a kernel  of 
degree  (1,1)  is  a special  case  of  equation  4.18  and  it  involves  only  six  expectations, 
some  of  which  may  also  be  similar. 

4.3  Example 

For  a two  sample  U-statistics,  we  used  Mann-Whitney-Wilcoxon  Rank  Sum 
test  statistic  as  an  example  to  test  the  accuracy  of  the  above  approximation.  Due  to 
the  non-existence  of  a closed-form  expression  for  the  cumulant  generating  function 
of  the  Mann-Whitney-WUcoxon  test  statistic,  we  had  to  approximate  the  cumulant 
generating  function.  We  used  (after  duplication)  the  null  distribution  of  Mann- 
Whitney-Wilcoxon  statistic  (for  m = 10  and  n=5)  as  published  by  Buckle,  Kraft  and 
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van  Eeden:  "Tables  Prolongeese  de  la  Distribution  de  Mann-Whitney-Wilcoxon" 
(1969b)  and  also  adapted  in  Lehman’s  (1975)  book  to  approximate  the  moment 
generating  function  and  hence  the  cumulant  generating  function.  This  was  used  to 
compare  not  only  with  our  approximation  of  the  cumulant  generating  function  but 
also  the  approximation  of  the  cumulant  generating  function  which  was  calculated  by 
the  first  four  exact  cumulants  of  the  Mann-Whitney-Wilcoxon  test  statistics  as 
provided  by  Sundrum  (1954),  see  Figure  4.1. 

Let  Xi,  X2, ...,  X„,  be  independently  and  identically  distributed  (i.i.d.)  random 
variables  with  distribution  function  F(X)  and  let  Yj,  Y2,  ...,  Y„  be  i.i.d.  random 
variables  with  distribution  function  G(Y)  = F(Y-A),  where  -00  < A < 00  is  an 
unknown  shift  parameter.  The  null  hypothesis  of  interest  for  this  setting  is  Ho:A=0, 
and  the  alternative  hypothesis  can  be  any  one  of  the  three  alternatives,  Hj:  A >,<, 
or  ^ 0.  To  test  the  above  hypothesis  the  Mann-Whitney-Wilcoxon  test  statistic  as 
proposed  by  Mann  and  Whitney  (1947)  is 

= E E 4>(  ) , 

i=i 


where 


4>(X) 


1 if  X>  0 

0 otherwise. 


Let  E = [ 0(  Yj  - Xj  ) ] = p,  then  for  deriving  our  approximation  for  cumulant 
generating  function  of  Mann-Whitney-Wilcoxon  test  statistic  the  corresponding 
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Figure  4.1 
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U-statistics  which  is  centered  about  zero  can  be  defined  by 

‘t  EH  I", ) 

1.1  /.I 

where  h{Xp  Yj)  = <|)(  ) - /?  . Note  that  again  we  do  not  follow  the 

usual  convention  of  dividing  the  sum  in  the  above  formulation  by  the  number  m n 
of  its  terms.  Since  our  results  concern  the  standardized  version  of  this  does  not 
make  any  difference.  To  apply  the  Hoeffding’s  decomposition  we  defined 

g,ix)  = Eh{(  X.,  Y.  ) I X.=x  ) 

= \ - G{x)  - p , 

g^{y)  = Eh[{  X„  Y.  ) I Y.  = y ) 

= F(  y ) - p and 

y)  = /i(  X,  y ) - AT  ) - y ) 

= P{y)  - P ' 

Under  this  setup  we  used  the  result  from  the  approximation  (for  moment 
generating  function)  as  given  in  equation  4.8  to  aproximate  the  tail  probability  by  the 
saddlepoint  approximation  as  given  in  Chapter  2,  equation  (2.8).  To  do  this  we  first 
had  to  approximate  six  expectations  (of  which  some  were  same)  under  the 
assumption  that  the  two  underlying  population  distributions  are  identical,  i.e.,  null 
hypothesis.  Again  since  U-statistics  have  a distribution  free  property,  these 
expectations  are  independent  of  the  nature  of  the  underlying  distribution.  These 
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expectations  were  found  to  be  as  follows: 

(i)  £,  = £ s,(  X,  ) «2(  K,  ) >|r(  X,  , y,  ) = 0 , 

(ii)  £j  = E g?(  X,  ) Sj(  yj  ) IK  X,  , y,  ) = -0-027778  , 

(iii)  £,  = E s,(  X,  ) s’(  X;  ) iM  X,  , y,  ) = -0-027778  , 

(iv)  E^  = E i|f2(  Xj  , y,  ) = 0-083333  , 

(v)  E,  = E Xj  ) Xj  = 0-013889  , 

(vi)  E,  = E 8^(  y,  ) g^(  Y^  ) iK  Xj  , yj  ) i|/(  X,  , y^  ) = 0-013889  • 

Note  that  in  equation  4.8,  except  for  these  expectations,  everything  else  is  either  a 
constant  or  a function  of  t.  Thus  when  we  use  eq.  4.8  to  approximate  the  cumulant 
generating  function  (of  Mann-Whitney  test  statistics)  and  henceforth  use  equation 
2.2  in  Chapter  2,  we  come  up  with  an  equation  in  t which  we  would  like  to  minimize 
with  respect  to  t.  The  value  of  t for  which  this  equation  is  minimized  is  the 
saddlepoint,  needed  for  the  saddlepoint  approximation. 

Besides  the  normal  approximation,  we  again  computed  two  other  saddlepoint 
approximations. 

The  first  was  based  on  the  cumulant  generating  function  as  evaluated  by  the 
null  distribution  of  the  Mann-Whitney-Wilcoxon  statistic  and,  since  the  tail 
probability  based  on  the  saddlepoint  approximation  (eq.  2.8)  was  very  close  to  that 
of  the  null  distribution  of  Mann-Whitney-Wilcoxon  test  statistic  (see  Table  4.1),  this 
tail  probability  approximation  was  again  excluded  from  Figure  4.2  to  keep  the 


82 


Table  4. 1.  Comparison  of  Tail  probability  approximation  of  Mann- Whitney-Wilcoxon 
Test  Statistic  (n=5,m=10)  between  Exact(Null)  distribution,  Normal  approximation 
and  Saddlepoint  approximations. 


w 

W„a 

Sp_app 

Exact 

Sp_ex 

Sp_K, 

Normal 

1 

-2.939390 

.0007 

0.000716267 

0.00164431 

2 

-2.816910 

.0013 

0.001357470 

0.00242438 

3 

-2.694440 

.0023 

0.002408450 

0.00352536 

4 

-2.571960 

.0040 

0.004014450 

0.00505617 

5 

-2.449490 

.0063 

0.006359880 

0.00715294 

6 

-2.327020 

.0097 

0.009662700 

0.00998223 

7 

-2.204540 

0.0159425 

.0140 

0.014171300 

0.01374320 

8 

-2.082070 

0.0209580 

.0200 

0.201600000 

0.01866820 

9 

-1.959590 

0.0283532 

.0276 

0.027921300 

0.02502180 

10 

-1.837120 

0.0379989 

.0376 

0.037757500 

0.03309630 

11 

-1.714640 

0.0501082 

.0496 

0.049968600 

0.04320540 

12 

-1.592170 

0.0649349 

.0646 

0.064839400 

0.0735473 

0.05567734 

13 

-1.469690 

0.0827180 

.0823 

0.082625000 

0.0884643 

0.07082230 

14 

-1.347220 

0.1036560 

.1032 

0.103536000 

0.1078120 

0.08895480 

15 

-1.224740 

0.1278910 

.1272 

0.127725000 

0.1309280 

0.11033600 

16 

-1.102270 

0.1554920 

.1548 

0.155273000 

0.1576530 

0.13517200 

17 

-0.979796 

0.1864470 

.1855 

0.186178000 

0.1878990 

0.16359300 

18 

-0.857321 

0.2206560 

.2198 

0.220347000 

0.2215380 

0.19563400 

19 

-0.734847 

0.2579250 

.2567 

0.257596000 

0.2583700 

0.23121600 

20 

-0.612372 

0.2979740 

.2970 

0.297649000 

0.2981080 

0.27014600 

21 

-0.489898 

0.3404200 

.3393 

0.340122000 

0.3403610 

0.31210300 

22 

-0.367423 

0.3848270 

.3839 

0.384580000 

0.3846810 

0.35665200 

23 

-0.244949 

0.4306760 

.4296 

0.430501000 

0.4305300 

0.40324800 

24 

-0.122474 

.4765 

0.477318000 

0.45126200 

Where 

W = Mann-Whitney-WUcoxon  Test  Statistic 

= Standardized  Test-Statistic 

Sp_app  = Saddlepoint  Approximation  Based  on  Our  Approximation 

Exact  = Null  Distribution  of  Test-Statistic 

Sp  ex  = SaddlejX)int  Approximation  Based  on  Exact  C.G.F. 

Sp_K4  = Saddlepoint  Approximation  Based  on  Exact  Cumulants  (Kj-K^) 

Normal  = Normal  Approximation 
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number  of  comparisons  optimal. 

For  the  second  Saddlepoint  approximation,  we  used  an  approach  of  G.  S. 
Easton  and  E.  Ronchetti  (1986)  where  they  proposed  to  approximate  the  first  four 
cumulants  of  the  statistic  and  in  turn  used  these  to  further  approximate  the  cumulant 
generating  function  used  in  the  saddlepoint  approximation.  Here,  mainly  to  optimize 
their  approximation,  we  used  the  exact  (first  four)  cumulants  of  the  Mann-Whitney- 
Wilcoxon  statistic  as  given  by  Sundaram  (1954)  to  approximate  the  cumulant 
generating  function  used  in  the  saddlepoint  approximation. 

Comparisons  of  these  approximations  with  the  exact  have  been  tabulated  in 
Table  4.1  and  plotted  in  Figures  4.2(A-C).  From  Table  4.1,  it  is  clear  that  the 
Saddlepoint  Approximation  works  quite  well  to  approximate  the  tail  probability 
under  the  null  distribution  for  the  Mann-Whitney-Wilcoxon  test  statistic.  In  fact  it 
is  quite  close  (up  to  the  fourth  decimal  place)  to  the  exact  if  cumulant  generating 
function  calculated  by  null  distribution  itself  is  used,  see  Table  4.1.  While 
saddlepoint  approximation  based  on  our  approximation  of  the  cumulant  generating 
function  also  works  quite  well.  It  not  only  beats  the  normal  (large  sample) 
approximation  over  almost  the  whole  range,  but  it  also  beats  the  saddlepoint 
approximation  as  proposed  by  Easton  and  Ronchetti  (1986)  in  JASA  where  the 
cumulant  generating  function  has  been  approximated  by  using  the  first  four 
cumulants  (exact). 

For  an  extreme  test  statistic  value,  when  the  tail  probability  becomes  very 
small,  i.e.,  the  standardized  test  statistic  becomes  quite  large,  the  saddlepoint  also 
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Comparison  of  Tail  Probability  Approximation 
Using  Exact,  Saddle-Point  and  Normal  Distribution 
Figure  4.2  (A) 
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Figure  4.2  (B) 
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becomes  relatively  larger  and  then  the  saddlepoint  approximation  (based  on  the 
cumulant  generating  function  based  on  Hoeffding’s  decomposition)  tends  to  lose 
itsaccuracy.  The  reason  for  this  is  that  for  large  values  of  t,  cumulant  generating 
function  approximated  by  the  decomposition  theorem  may  not  be  too  accurate  and 
hence  the  saddlepoint  itself  may  be  less  accurate.  If  this  becomes  a problem,  we 
suggest  that  some  of  the  higher  order  terms  may  again  need  to  be  considered.  It  may 
be  worthwhile  to  note  that  this  is  not  a concern  for  saddlepoint  approximation  based 
on  cumulant  generating  function  evaluated  by  using  the  null  distribution,  which  beats 
the  normal  approximation  over  the  entire  range. 


CHAPTER  5 

COMPARISON  OF  THE  TAIL  PROBABILITY  APPROXIMATION 

BY  THE  METHOD  OF 

EDGEWORTH  EXPANSION  AND  SADDLEPOINT  APPROXIMATION 


Let 


a(  X ) = — e ; - «><  x < ~ 


and  D = — . 

dX 


Now,  consider  successive  derivatives  of  a(  X ) with  respect  to  X.  We  have 

Da(X)  = -X-a(X) 

D^a(X)  = (X^-1)  • a(X) 

D^a(X)  = (3X-X^)-a(X) 

D^a(X)  = (X^-6X^  + 3)‘a(X) 


and  so  on. 

Edgeworth  (1904)  derived  a series 
for  a standardized  case  (when  kj  = 0 and 

D* 

/(X)  = expl-L—  + L—  - ....} 
^3!  ^4! 


to  approximate  a density  function,  which 
k2  = 1)  reduces  to, 

•a(X). 
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Edgeworth  expansion,  even  though  it  has  been  used  to  a large  extent  both  in 
past  and  present  for  approximation  (of  density  or  distribution),  still  has  problems 
associated  with  it.  These  have  been  discussed  in  several  books  as  well  as  papers. 
Statistically  speaking,  it  is  not  as  important  to  use  a series  to  represent  a density 
function  as  it  is  to  make  sure  that  a finite  number  of  terms  can  result  in  a 
satisfactory  approximation.  It  is  important  here  to  note  that  the  series  used  in 
Edgeworth  expansion 

• may  give  negative  frequencies,  particularly  near  the  tails,  when  a finite 
number  of  terms  are  used  (Kendal,  Stuart  and  Ord  (1987)  Vol.  1). 

• may  behave  irregularly  in  the  sense  that  the  sum  of  k terms  may  result  in  a 
worse  fit  than  the  sum  of  (k  - 1)  terms. 

For  many  statistical  purposes,  we  are  most  interested  in  the  tails  of 
distribution,  and  it  is  here  that  shortcomings  of  the  series  (Edgeworth  expansion) 
approach  appear. 

P.  J.  Bickel,  F.  Gotze  and  W.  R.  Van  Zwet  (1986)  under  some  general  and 
mild  assumptions  (similar  to  the  ones  assumed  in  this  dissertation)  gave  an 
Edgeworth  expansion  for  U-statistics  (one  sample)  of  degree  two.  Their  results  also 
assume  (without  loss  of  generahty)  a standardized  U-Statistic  based  on  a symmetric 
kernel  among  some  other  technical  assumptions. 

To  be  specific,  if 

£ /i(  Xj  , X2  ) = 0 ; Eh\X^  ,X^)  < «>, 


and  define 
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g(  Xj ) = £ ( /i(  X, , Xj ) I X, ) , iK  X,y ) = /I(  X,y ) - g(  X ) - g(  y ) 
and  also, 

N N-1  N 

g(.x,)  , iKx,  ,X() 

i«l  i»l  y=/+l 


t'N  = E E M X,  , X;  ) = . 

i=l  y=i+l 


(5.1) 


Furthermore 

if  ol  = Var(  U^)  ; ol  = EgHX^)>0 

and  Let, 

k,  = o;’  ( £ g’(  X,  ) * 3 E g(  X,  ) «(  X,  ) Ili(  X„X,  ) I 

= o;'  i £ g*(  X^  ) - 3o‘  ■.  12  £ gH  Xi  ) g(  X,  ) i|r(  Xj,X,  ) 

* 12  £ g(  X,  ) g(  X,  ) iK  X„Xj  ) t(  Xj,X,  ) I (5.2) 

then  finally  under  the  assumption 


Eh^X^X^)  < oo 


they  show  that  ^3  ^ and  ^4  ^ are  asymptotic  expressions  with  error 
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o(.  N ^ ) for  the  third  and  fourth  cumulants  of  • their  Edgeworth 


expansion  under  this  setup  is  given  by 


X ) = $(  X ) - <J)(  X ) 


^ (x2  - 1)  + . N-^  (x^  - 3x) 

6 24 


+ — ' N-^  (x^  - lOx^  + 15x) 
72 


(5.3) 


and  the  related  order  of  approximation  is  o(  N~^  ) 


We  further  replaced  K ^ ^ and  K ' ^ by  the  exact  third  and  fourth 


“ 1 » T 

cumulants  of  in  (5.3)  to  optimize  the  accuracy  of  their  expansion  and  hence 


obtain  the  Edgeworth  expansion  as  follows: 


^//(  ^ ) = ^(  ^ ) - 4>(  a:  ) 


■l(x^  - D*  - 3x) 

6 24 


+ — (x’  - lOx^  + 15x) 
72 


(5.4) 


It  should  also  be  noted  here  that  in  our  examples,  the  equations  (5.3)  and 
(5.4)  for  the  two  Edgeworth  expansions  gave  the  same  approximation  to  the  tail 
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probability  upto  the  third  decimal  place,  i.e.,  see  Table  (5.1).  Then  equation  (5.4) 
was  used  to  approximate  the  tail  probability  for  comparison  with  our  saddlepoint 
approximation  of  tail  probability  where  the  cumulant  generating  function  was  in  turn 
approximated  by  using  Hoeffding’s  decomposition,  i.e.  equation  (3.10)  in  Chapter  3. 
We  carried  out  this  comparison  for  one  sample  Wilcoxon  Signed  Rank  test  statistic 
based  on  a sample  of  size  17  under  the  null  hypothesis.  Note  that  under  the  null 
hypothesis  (since  the  standardized  test  statistic  is  symmetric  about  zero)  for  this  test 
k3  = 0. 

Saddlepoint  based  approximation  did  much  better  over  the  whole  range  of 
values  associated  with  the  test  statistic  as  compared  to  both  Bickel,  Gotze  and  Van- 
Zwet’s  Edgeworth  expansion  of  a one  sample  U-statistic  of  degree  2,  i.e.  equation 
(5.3)  and  Edgeworth  expansion  based  on  exact  cumulants,  i.e.  equation  (5.4),  see 
Figures  5.1(A-C). 
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Table  5.1.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  between  Exact  (Null)  Distribution,  Normal  Approximation, 
Edgeworth  Expansion  and  Saddlepoint  Approximation 


w* 

^ ad 

Sp_app 

Exact 

Edge_ex 

Edge_Bk 

Normal 

35 

-1.964530 

0.0262973 

0.0253 

0.0239 

0.0239 

0.0247 

40 

-1.727840 

0.0449721 

0.0443 

0.0420 

0.0420 

0.0420 

45 

-1.491150 

0.0731930 

0.0727 

0.0693 

0.0692 

0.0680 

50 

-1.254460 

0.1127630 

0.1123 

0.1076 

0.1075 

0.1048 

55 

-1.017770 

0.1648830 

0.1645 

0.1585 

0.1583 

0.1544 

60 

-0.781079 

0.2296940 

0.2293 

0.2221 

0.2219 

0.2174 

65 

-0.544388 

0.3060150 

0.3056 

0.2974 

0.2973 

0.2931 

70 

-0.307698 

0.3913040 

0.3910 

0.3821 

0.3820 

0.3792 

75 

-0.071007 

0.4818850 

0.4816 

0.4724 

0.4724 

0.4717 

where 

W* 

” std 

Sp_app 

Exact 

Edge_ex 

Edge_Bk 

Normal 


= Wilcoxon  Signed  Rank  Test  Statistic 
= Standardized  Test-Statistic 

= Saddlepoint  Approximation  Based  on  Our  Approximation 
= Null  Distribution  of  Test-Statistic 
= Edgeworth  Expansion  Based  on  Exact  Cumulants  (K,-K4) 
= Edgeworth  Expansion  (Bickel,Gotze  and  Van  Zwet) 

= Normal  Approximation 


94 


XjnjqBqoid  [iBx 


Test  Statistic  (Standardized) 

Wilcoxon  Signed  Rank  Test 
Comparison  of  Tail  Probability  Approximation 
Using  Exact,  Saddlepoint(approx.  C.G.F.),  Normal  and  Edge-worth  Expansion 

Figure  5.1  (A) 
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Wilcoxon  Signed  Rank  Test 
Comparison  of  Tail  Probability  Approximation 
Using  Exact,  Saddlepoint(approx.  C.G.F.),  Normal  and  Edge-worth  Expansion 

Figure  5.1  (B) 
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Test  Statistic  (Standardized) 

Wilcoxon  Sijmed  Rank  Test 
Comparison  of  Tail  Probability  Approximation 
Using  Exact,  Saddlepoint(approx.  C.G.F.),  Normal  and  Edge-worth  Expansion 

Figure  5.1  (C) 


CHAPTER  6 

POWER  OF  ONE  SAMPLE  U-STATISTICS 


The  computation  of  the  exact  power  can  be  quite  difficult  for  the  one  sample 
U-statistics  even  for  moderate  sample  sizes.  One  of  the  reasons  for  this  is  that  the 
exact  distribution  of  the  U-statistics  depends  on  the  distribution  of  the  ranks  which 
may  not  be  easy  to  handle.  In  view  of  the  asymptotic  normahty  of  the  U-statistics, 
it  is  reasonable  to  use  normal  approximation  to  estimate  it’s  power.  It  needs  to  be 
noted  here  that  the  normal  approximation  may  in  fact  work  quite  well  as  long  as: 

• the  sample  size  is  large  enough; 

• under  the  alternative  hypothesis,  the  location  shift  parameter(A)  is  not  large; 

• the  underlying  population  distribution  is  not  a heavy  tailed  distribution. 

In  the  absence  of  any  of  the  above,  the  normal  approximation  may  not  be  found  to 
be  as  accurate  as  desired.  In  view  of  these  problems  associated  with  the  large 
sample  approximation,  we  propose  that  the  saddlepoint  based  our  tail  probability 
approximation  may  provide  a viable  alternative  to  normal  approximation. 

6.1  Discussion  and  Methods 

Observe  that  the  tail  probability  approximation  using  the  saddlepoint 
methodology  derived  under  a minimal  set  of  assumptions,  in  Chapter  3 is  a general 


97 


result  for  a one  sample  U-statistic,  based  on  a kernel  of  degree  K.  We  also  assume 
that  the  underlying  distribution  (population)  is  a continuous  distribution  function. 
Thus,  to  use  equation  (2.8)  in  Chapter  2 to  approximate  power  under  an  alternative 
requires  us  first  to  approximate  the  cumulant  generating  function  of  the  standardized 
U-statistic  (with  a kernel  of  degree  K)  by  equation  (3.20)  in  Chapter  3.  To 
approximate  the  cumulant  generating  function,  we  further  need  to  approximate  a few 
expectations  under  alternative  hypothesis.  Standard  Numerical  Integration 
techniques  can  be  used  to  do  this.  Note,  since  these  expectations  are  based  on 
functions  of  kernel  of  degree  K such  as, 

g,(  Xj  ) = EK  X„  ...  I X,  = I, ) 

*2<  y = B h(  X, X,  I X,  - X,  , X,  - Xj  ) (6.1) 

and, 

8$(.  ^ — E h(^  Xj,  ...  I Xj  — , X.2  — ^2  , Xj  — Xj  ) . 

i.e.,  some  conditional  expectations  of  the  kernel.  Note  that  if  we  are  interested  only 
in  approximating  the  power  of  a U-statistic,  then  since  these  conditional  expectations 
of  the  kernel  would  be  independent  of  the  sample  size  if  the  kernel  itself  is 
independent  of  N (sample  size),  i.e.,  we  would  not  have  to  re-evaluate  these  for 
different  sample  sizes  for  a specific  one  sample  U-statistics. 

Occasionally,  we  will  come  across  examples  of  non-parametric  test  statistics 
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where  the  kernel  may  be  dependent  on  the  sample  size  simply  because  the  test 
statistic  is  a linear  combination  of  U-statistics  (whose  tail  probability  can  also  be 
handled  by  the  same  methodology).  For  these  we  would  find  a dependency  of  the 
above  conditional  expectations  (6.1)  on  the  sample  size.  One  such  example  is  the 
Wilcoxon  Signed  Rank  test  (as  discussed  in  Chapter  3). 

In  approximating  these  expectations,  care  must  be  taken  not  to  sacrifice  the 
accuracy  simply  because  these  expectations  are  usually  quite  small,  i.e.,  or  order 
between  10  ^ and  10 

To  calculate  power  under  different  shift  alternatives,  we  decided  to  again  use 
one  sample  Wilcoxon  Signed  Rank  (which  is  a linear  combination  of  two  U-statistics) 
as  an  example  because  of  two  reasons: 

1.  It  is  a linear  combination  of  two  U-statistics. 

2.  The  form  for  gi(Xi)  and  ^(Xj,X2)  has  already  been  specified  in  Chapter  3. 
Note  that  the  tail  probability  under  null  hypothesis  (p-value)  is  independent 

of  the  underlying  distribution  (as  long  as  it  is  continuous  and  symmetric  about  0) 
since  the  U-statistics  is  distribution  free.  Thus,  these  expectations  should  be  the 
same  under  all  the  continuous  distributions  which  are  symmetric  about  zero.  For 
one  sample  Wilcoxon  Signed  Rank  test,  we  only  had  to  evaluate  4 different 
expectations  which  were  as  follows: 

(i)  Ej  = E g^{x)  ^j(  y ) t2(J^,y)  , {say) 

(ii)  = E gfi  X)  g^iy)  x,y  ) 
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(iii)  £3  = E g^ix)  gj(  y ) vjfjC  ) t2( 

(iv)  E,  = Ey\fl(x,y) 
where 

g,  ( X ) = ( 1 - F(  -X  ) - f (I  - F(  -X))dF(,X)] 
and 

1|>J  (X,  y)  - 1,0 .,  ( X . y)  * F(  -X)  * f ( -y) 

* / ( 1 - F ( -X  ) ) dF(  X ) 

Exact  power  for  Wilcoxon  Signed  Rank  test  is  not  easy  to  calculate.  It  has 
been  tabulated  by  Kotz  (1963)  for  normal  shift  alternatives  and  by  Arnold  (1965)  for 
shifted  t-distributions  with  1/2,  1,  2,  and  4 degrees  of  freedom  for  sample  sizes 
between  5 and  10.  Usually  a large  sample  approximation  (normal)  is  used  to 
calculate  power  which  also  may  be  inaccurate,  as  discussed  at  the  beginning  of  this 
chapter.  There  is  also  a discussion  in  Lehman’s  (1975)  book  "Nonparametrics"  about 
these  issues. 

If  a test  is  expressed  in  terms  of  critical  region  > c,  its  power  against  any 
fixed  alternative 


Hy  F is  symmetric  about  ^ * 0. 
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is  given  by 


n,  = p oy.  ^ c ) = p 


( (»;))'«  ( Var^  ( IP, ) ) '« 


(6.3) 

where  if 

P,=P^(z>0)  , P,  = P^iz,^  z^>0) 

and 

P3  = ( Zj  + Zj  ^ 0 fl/wi  Zj  + Z3  > 0 ) 

then 

^ ^ ^ ~ ^ ^ ■ P^  + N P^  (6.4) 

and 

( IT,  ) = iV  ( - 1 ) ( - 2 ) ( P3  - P2"  ) + 

^ ^ [ 2(  Pi  - P2  )2  OP2  ( 1 - P2  )]  ^ (6.5) 


N P,  ( 1 - Pj  ) 


Note  that  even  this  normal  approximation  discussed  above  requires  the 
computation  of  Pj,  P2,  and  Pj.  There  is  another  asymptotic  approximation  of  power 
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discussed  by  Lehman  (1975),  but  it  only  works  well  for  very  small  (delta’s)  under 
shift  alternatives  and  large  enough  sample  sizes.  According  to  Lehman  this  may  in 
fact  give  completely  misleading  results  in  the  Cauchy  alternatives.  Even  the  normal 
approximation  (eq.  6.3)  may  not  be  accurate  enough  sometimes,  especially  when 
there  is  a heavier  tailed  distribution,  e.g.,  Cauchy,  the  sample  size  is  not  large  enough 
or  delta,  the  location  shift  parameter  is  not  small.  As  a result  this  approximation 
may  only  work  better  if  the  underlying  distribution  is  either  normal  or  quite  similar 
to  normal  distribution,  delta  is  small  and  the  sample  size  is  large  enough. 

Computation  of  power  of  Wilcoxon  Signed  Rank  test  for  comparison  between 
Saddlepoint  method  using  the  approximated  cumulant  generating  function,  normal 
approximation  (equation  6.3)  and  exact  was  performed  under  normal,  logistic  and 
Cauchy  alternatives.  These  three  distributions  were  chosen  simply  to  illustrate  how 
well  the  tail  probability  approximation  based  on  the  saddlepoint  method  using  the 
approximated  cumulant  generating  function  can  handle  the  tail’s  getting  thicker,  i.e., 
normal  density  goes  to  zero  at  a very  fast  rate  of  exp(-x^)  while  the  Cauchy  density 
goes  to  zero  at  a relatively  much  slower  rate  of  x On  the  other  hand  for  the  logistic 
distribution,  the  density  goes  to  zero  at  a rate  of  e * which  is  between  that  of  normal 
and  Cauchy.  Since  only  shift  alternatives  were  going  to  be  considered,thus  for 
consistency  between  the  comparisons  for  standard,  normal,  logistic  and  Cauchy 
distributions  (all  symmetric  about  zero),  alternatives  were  symmetric  about  their  60, 
70  and  80  percentiles  in  each  of  the  three  cases.  As  a result,  nine  different 
alternatives  were  taken  into  consideration,  three  for  each  distribution. 
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The  expectations  needed  to  approximate  the  cumulant  generating  function 
further  needed  to  approximate  the  power  by  the  saddlepoint  method  have  been 
tabulated  in  Tables  6. 1-6.3  on  next  two  pages.  Note  that  there  are  only  four 
expectations  needed  to  successfully  use  our  saddlepoint  based  approximation  to 
approximate  the  tail  probability  for  the  Wilcoxon  Signed  Rank  test  under  different 
alternatives. 


6.2  Results 

We  used  the  Wilcoxon  Signed  Rank  (N=17)  test  (same  as  in  3'‘*  and  4^** 
chapters)  as  our  example  for  comparison  purposes.  As  per  our  discussion  in  the 
previous  section,  we  considered  location  shift  alternatives  under  normal,  logistic  and 
Cauchy  distributions.  Standardized  forms  of  these  distributions  (each  symmetric 
about  zero)  were  used  under  the  null  hypothesis  for  the  test  statistic.  While  for 
location  shift  alternatives,  we  used  60‘^  70“"  and  80“'  percentiles  of  these  standardized 
distributions  as  values  for  delta  (A),  the  location  shift  parameter. 

Nine  tables  (6.4  - 6.12)  have  been  produced  to  compare  the  exact  tail 
probabilities  under  the  various  location  shift  alternatives  for  each  of  the  three 
underlying  distributions  with  both  our  saddlepoint  based  tail  probability 
approximation  and  large  sample  (normal)  approximation.  We  plotted  the  left  tail 
probabilities  for  the  80“'  percentile  location  shift  alternatives  under  both  normal 
(Figure  6.1)  and  Cauchy  (Figure  6.2)  distributions.  This  was  done  simply  to  get  a 
flavor  for  these  results.  All  the  exact  tail  probabilities  were  calculated  by  10  million 
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Table  6.1.  Computation  of  Expectations  Needed  to  Approximate  the  Cumulant 
Generating  Function  for  Power  Calculation  of  Wilcoxon  Signed  Rank  Test  under 
Normal  Shift  Alternatives 


60  Percentile 

70  Percentile 

80  Percentile 

E. 

-0.00374194 

-0.00480049 

-0.00291832 

E, 

-0.00243942 

-0.000533953 

0.000534172 

E3 

0.00164375 

0.00125209 

0.00056999 

E4 

0.0801455 

0.0695841 

0.0497001 

Where 

E^  = E g^{x)  g,(  y ) ilfjC  x,y  ) 

E^  = E gli  x)  g^iy)  il/jf  x,y  ) 

£3  = £ 5i(  X ) gj(  y ) ilTjC  ) ijfjf  ) 

E^  = E tjC  x,y  ) 
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Table  6.2.  Computation  of  Expectations  Needed  to  Approximate  the  Cumulant 
Generating  Function  for  Power  Calculation  of  Wilcoxon  Signed  Rank  Test  under 
Logistic  Shift  Alternatives 

60  Percentile 

70  Percentile 

80  Percentile 

E,  -0.00241817 

Ej  -0.00268175 

Ej  0.00158255 

E4  0.0797383 

-0.00331032 

-0.00106512 

0.0011335 

0.0685381 

-0.00220906 

-0.00000398 

0.0005165 

0.0489896 

Note:  El  - E4  are  defined  the  same  as  on  page  110. 

Table  6.3.  Computation  of  Expectations  Needed  to  Approximate  the  Cumulant 
Generating  Function  for  Power  Calculation  of  Wilcoxon  Signed  Rank  Test  under 
Cauchy  Shift  Alternatives 

60  Percentile 

70  Percentile 

80  Percentile 

E,  0.00189138 

Ej  -0.00346422 

Ej  0.00156578 

E4  0.07843870 

0.00301712 

-0.00331040 

0.00111967 

0.06405560 

0.00283385 

-0.00266631 

0.00055827 

0.04215510 

Note:  E,  - E4  are  defined  the  same  as  on  page  1 10. 
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simulations.  The  mean  and  variance  under  each  of  these  alternatives  were  calculated 
by  equations  6.4  and  6.5. 

It  should  be  noted  that  our  saddlepoint  based  tail  probability  approximation 
does  consistently  better  than  normal  approximation  over  each  of  the  nine  alternatives 
for  this  example.  Again  for  an  extreme  test  statistic  value,  when  the  test  statistic  is 
quite  large,  the  saddlepoint  is  also  relatively  larger  and  as  a result  our  saddlepoint 
approximation  (calculated  by  using  the  cumulant  generating  function  based  on 
Hoeffding’s  decomposition)  may  tend  to  lose  it’s  accuracy.  The  main  reason  for  this 
may  be  that  the  saddlepoint  itself  may  not  be  too  accurate  since  the  cumulant 
generating  function  approximated  by  Hoeffding’s  decomposition  is  not  accurate 
enough.  If  this  is  a problem  then  we  suggest  to  include  a few  more  higher  order 
terms  to  improve  its  accuracy. 
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Table  6.4.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Normal  Alternative(A= 0.253347)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


^ std 

Exact 

Spapp 

Normal 

Left  Tail 


33 

-3.1914 

0.0012 

0.0010 

0.0007 

38 

-2.9429 

0.0027 

0.0026 

0.0016 

43 

-2.6945 

0.0055 

0.0053 

0.0035 

48 

-2.4461 

0.0106 

0.0103 

0.0072 

53 

-2.1976 

0.0191 

0.0188 

0.0149 

58 

-1.9492 

0.0328 

0.0324 

0.0256 

63 

-1.7008 

0.0536 

0.0533 

0.0444 

68 

-1.4524 

0.0835 

0.0834 

0.0732 

73 

-1.2039 

0.1247 

0.1247 

0.1143 

78 

-0.9555 

0.1785 

0.1789 

0.1696 

83 

-0.7071 

0.2454 

0.2462 

0.2497 

88 

-0.4587 

0.3248 

0.3257 

0.3232 

93 

-0.2102 

0.4138 

0.4151 

0.4168 

Right  Tail 

98 

0.0382 

0.5099 

0.5093 

0.4848 

103 

0.2866 

0.4132 

0.4128 

0.3872 

108 

0.5351 

0.3194 

0.3194 

0.2963 

113 

0.7835 

0.2340 

0.2345 

0.2167 

118 

1.0319 

0.1611 

0.1624 

0.1510 

123 

1.2803 

0.1030 

0.1057 

0.1002 

128 

1.5288 

0.0603 

0.0676 

0.0632 

where 

” std 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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Table  6.5.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Normal  Alternative(A= 0.524401)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


W^d 

Exact 

Sp_app 

Normal 

Left  Tail 

53 

-3.7081 

0.0008 

0.0008 

0.0001 

58 

-3.4172 

0.0018 

0.0018 

0.0002 

63 

-3.1263 

0.0036 

0.0037 

0.0008 

68 

-2.8354 

0.0069 

0.0065 

0.0023 

73 

-2.5445 

0.0127 

0.0123 

0.0054 

78 

-2.2536 

0.0226 

0.0222 

0.0120 

83 

-1.9627 

0.0383 

0.0382 

0.0248 

88 

-1.6718 

0.0626 

0.0628 

0.0472 

93 

-1.3809 

0.0971 

0.0990 

0.0826 

98 

-1.0900 

0.1479 

0.1495 

0.1378 

103 

-0.7991 

0.2141 

0.2162 

0.2120 

108 

-0.5082 

0.2974 

0.3000 

0.3057 

113 

-0.2173 

0.3966 

0.4000 

0.4141 

Right  Tail 

118 

0.0737 

0.5154 

0.5371 

0.4706 

123 

0.3646 

0.4002 

0.3981 

0.3577 

128 

0.6555 

0.2874 

0.2845 

0.2561 

133 

0.9464 

0.1868 

0.1820 

0.1719 

138 

1.2373 

0.1006 

0.0991 

0.1080 

143 

1.5282 

0.0497 

0.0437 

0.0632 

148 

1.8191 

0.0170 

0.0163 

0.0345 

where 

” ad 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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Table  6.6.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Normal  Alternative(A=0.841621)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


w* 

w+ 

^ ad 

Exact 

Spapp 

Normal 

Left  Tail 

83 

-4.0576 

0.0013 

0.0008 

0 

88 

-3.6574 

0.0026 

0.0020 

0.0001 

93 

-3.2572 

0.0055 

0.0047 

0.0005 

98 

-2.8569 

0.0110 

0.0103 

0.0021 

103 

-2.4567 

0.0214 

0.0210 

0.0070 

108 

-2.0565 

0.0398 

0.0403 

0.0198 

113 

-1.6562 

0.0707 

0.0727 

0.0488 

118 

-1.2560 

0.1206 

0.1240 

0.1045 

123 

-0.8558 

0.1958 

0.2001 

0.1961 

128 

-0.4555 

0.3016 

0.3061 

0.3243 

133 

-0.0553 

0.4392 

0.4438 

0.4779 

Right  Tail 

138 

0.3449 

0.4336 

0.4224 

0.3650 

143 

0.7451 

0.2686 

0.2334 

0.2280 

148 

1.1454 

0.0986 

0.0969 

0.1260 

153 

1.5456 

0.0224 

0.0377 

0.0610 

where 

std 

Exact 

Sp_app 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 


Normal 


no 


Table  6.7.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Logistic  Alternative(A= 0.405465)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


Exact 

Sp_app 

Normal 

Left  Tail 

33 

-3.1173 

0.0014 

0.0014 

0.0009 

38 

-2.8706 

0.0030 

0.0030 

0.0020 

43 

-2.6239 

0.0063 

0.0061 

0.0043 

48 

-2.3773 

0.0120 

0.0117 

0.0087 

53 

-2.1306 

0.0216 

0.0211 

0.0166 

58 

-1.8839 

0.0368 

0.0363 

0.0298 

63 

-1.6373 

0.0596 

0.0591 

0.0508 

68 

-1.3906 

0.0923 

0.0918 

0.0822 

73 

-1.1439 

0.1366 

0.1363 

0.1263 

78 

-0.8973 

0.1937 

0.1937 

0.1848 

83 

-0.6506 

0.2637 

0.2640 

0.2577 

88 

-0.4039 

0.3453 

0.3464 

0.3531 

93 

-0.1572 

0.4358 

0.4445 

0.4375 

Right  Tail 

98 

0.0894 

0.4882 

0.4644 

103 

0.3361 

0.3925 

0.3929 

0.3684 

108 

0.5828 

0.3013 

0.3015 

0.2800 

113 

0.8294 

0.2192 

0.2201 

0.2034 

118 

1.0761 

0.1499 

0.1517 

0.1409 

123 

1.3228 

0.0953 

0.0980 

0.0930 

128 

1.5694 

0.0556 

0.0644 

0.0583 

where 

W* 

” ad 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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Table  6.8.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Logistic  Alternative(A= 0.847298)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


^ std 

Exact 

Spapp 

Normal 

Left  Tail 


48 

-3.7809 

0.0005 

0.0004 

0.0001 

53 

-3.4987 

0.0011 

0.0010 

0.0002 

58 

-3.2164 

0.0024 

0.0022 

0.0006 

63 

-2.9342 

0.0048 

0.0045 

0.0017 

68 

-2.6519 

0.0092 

0.0088 

0.0040 

73 

-2.3697 

0.0169 

0.0163 

0.0089 

78 

-2.0874 

0.0294 

0.0288 

0.0184 

83 

-1.8052 

0.0490 

0.0485 

0.0365 

88 

-1.5229 

0.0785 

0.0780 

0.0639 

93 

-1.2407 

0.1201 

0.1202 

0.1074 

98 

-0.9584 

0.1768 

0.1773 

0.1689 

103 

-0.6762 

0.2501 

0.2507 

0.2495 

108 

-0.3939 

0.3382 

0.3401 

0.3468 

113 

-0.1117 

0.4398 

0.4624 

0.4555 

Right  Tail 
118 

0.1706 

0.4728 

0.4773 

0.4323 

123 

0.4528 

0.3619 

0.3614 

0.3253 

128 

0.7351 

0.2569 

0.2569 

0.2311 

133 

1.0173 

0.1659 

0.1670 

0.1545 

138 

1.2996 

0.0941 

0.0987 

0.0969 

where 

W* 

std 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 


112 


Table  6.9.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Logistic  Alternative( A =1.38629)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


w* 

«td 

Exact 

Sp_app 

Normal 

Left  Tail 

78 

-3.9934 

0.0011 

0.0008 

0.0000 

83 

-3.6226 

0.0023 

0.0019 

0.0001 

88 

-3.2518 

0.0047 

0.0041 

0.0006 

93 

-2.8811 

0.0094 

0.0086 

0.0020 

98 

-2.5103 

0.0181 

0.0172 

0.0060 

103 

-2.1395 

0.0333 

0.0327 

0.0162 

108 

-1.7688 

0.0591 

0.0588 

0.0385 

113 

-1.3980 

0.0999 

0.1005 

0.0811 

118 

-1.0272 

0.1612 

0.1630 

0.1522 

123 

-0.6565 

0.2478 

0.2506 

0.2558 

128 

-0.2857 

0.3604 

0.3658 

0.3875 

Right  Tail 

133 

0.0851 

0.5321 

- 

0.4661 

138 

0.4558 

0.3824 

0.3782 

0.3243 

143 

0.8266 

0.2379 

0.2218 

0.2042 

148 

1.1974 

0.1138 

0.0897 

0.1156 

where 

W* 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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Table  6.10.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Cauchy  Alternative  (A =0.32492)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


^ std 

Exact 

Spapp 

Normal 

Left  Tail 


38 

-2.5893 

0.0055 

0.0058 

0.0048 

43 

-2.3487 

0.0108 

0.0110 

0.0094 

48 

-2.1080 

0.0203 

0.0203 

0.0175 

53 

-1.8674 

0.0354 

0.0351 

0.0308 

58 

-1.6267 

0.0585 

0.0578 

0.0519 

63 

-1.3861 

0.0913 

0.0905 

0.0829 

68 

-1.1455 

0.1357 

0.1351 

0.1260 

73 

-0.9048 

0.1928 

0.1925 

0.1828 

78 

-0.6642 

0.2625 

0.2625 

0.2533 

83 

-0.4235 

0.3432 

0.3435 

0.3360 

88 

-0.1829 

0.4320 

0.4325 

0.4274 

Right  Tail 

93 

0.0578 

0.4938 

0.4938 

0.4770 

98 

0.2984 

0.4009 

0.4012 

0.3827 

103 

0.5390 

0.3124 

0.3132 

0.2949 

108 

0.7797 

0.2326 

0.2339 

0.2180 

113 

1.0203 

0.1645 

0.1663 

0.1538 

118 

1.2610 

0.1097 

0.1119 

0.1037 

123 

1.5016 

0.0683 

0.0712 

0.0666 

128 

1.7422 

0.0395 

0.0414 

0.0407 

where 

W* 

W* 

" ad 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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Table  6.11.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Cauchy  Alternative  (A=0.726543)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


Exact  Sp_app  Normal 


Left  Tail 


53 

-2.7499 

0.0048 

0.0050 

0.0030 

58 

-2.4958 

0.0093 

0.0095 

0.0063 

63 

-2.2416 

0.0173 

0.0172 

0.0125 

68 

-1.9875 

0.0305 

0.0299 

0.0238 

73 

-1.7333 

0.0508 

0.0497 

0.0415 

78 

-1.4792 

0.0806 

0.0790 

0.0695 

83 

-1.2250 

0.1220 

0.1200 

0.1103 

88 

-0.9709 

0.1762 

0.1745 

0.1658 

93 

-0.7167 

0.2441 

0.2430 

0.2368 

98 

-0.4626 

0.3246 

0.3244 

0.3218 

103 

-0.2084 

0.4147 

0.4159 

0.4175 

Right  Tail 

108 

0.0457 

0.5073 

0.5072 

0.4818 

113 

0.2999 

0.4092 

0.4096 

0.3821 

118 

0.5541 

0.3155 

0.3161 

0.2898 

123 

0.8022 

0.2272 

0.2319 

0.2095 

128 

1.0624 

0.1548 

0.1612 

0.1440 

133 

1.3165 

0.0982 

0.1093 

0.0940 

where 

W* 

Exact 

Sp_app 

Normal 


Wilcoxon  Signed  Rank  Test  Statistic 
Standardized  Test-Statistic 

Exact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepxjint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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Table  6.12.  Comparison  of  Tail  Probability  Approximation  of  Wilcoxon  Signed  Rank 
Test  Statistic  (N=17)  under  Cauchy  Alternative  (A=  1.37638)  between  Exact 
Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


W*  Exact  Sp_app  Normal 


Left  Tail 


78 

-2.5280 

0.0117 

0.0116 

0.0057 

83 

-2.2433 

0.0216 

0.0209 

0.0124 

88 

-1.9587 

0.0370 

0.0362 

0.0251 

93 

-1.6740 

0.0622 

0.0599 

0.0471 

98 

-1.3893 

0.0978 

0.0952 

0.0824 

103 

-1.1046 

0.1457 

0.1448 

0.1347 

108 

-0.8199 

0.2154 

0.2108 

0.2061 

113 

-0.5352 

0.2932 

0.2935 

0.2963 

118 

-0.2505 

0.3845 

0.3905 

0.4011 

Right  Tail 

123 

0.0342 

0.5192 

0.5283 

0.4864 

128 

0.3189 

0.4165 

0.4176 

0.3749 

133 

0.6035 

0.3191 

0.3146 

0.2731 

138 

0.8882 

0.1955 

0.2305 

0.1872 

where 

= Wilcoxon  Signed  Rank  Test  Statistic 
= Standardized  Test-Statistic 
Exact  = 

Sp_app  = 


Normal 


Eixact(Simulated)  Distribution  of  T.Stat  under  the  Alternative 
Saddlepoint  Approximation  Based  on  Our  Approximation 
Normal  Approximation 
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CHAPTER  7 

POWER:  TWO  SAMPLE  U-STATISTICS 


The  computation  of  exact  power  may  be  quite  tedious  and  difficult,  simply 
because  the  distribution  of  ranks  under  that  alternative  may  not  always  be  easy  to 
handle.  As  a result,  computation  for  exact  power  is  usually  not  a feasible  approach 
to  estimation  of  power.  Simulation  is  a possible  alternative;  however,  it  is  sensitive 
to  random  number  generators.  A consequence  of  random  fluctuations  may  cause 
strange  results  for  tail  probability  approximation  under  the  alternative. 

On  the  other  hand,  similar  to  one  sample  situation,  normal  approximation  is 
possibly  another  alternative  available  for  all  two  sample  U-statistics.  Normal 
approximation  works  quite  well  as  long  as  the  sample  size  is  large  enough,  underlying 
distribution  is  not  a heavy  tailed  distribution  and  location  change  is  not  too  large 
under  the  alternative.  In  the  absence  of  any  of  the  above  three  conditions,  normal 
approximation  may  not  perform  very  well.  In  light  of  these  problems  associated  with 
the  large  sample  approximation,  we  suggest  that  our  saddlepoint  approximation  may 
give  a reasonable  alternative  for  the  tail  probability  approximation  for  a two  sample 
U-statistics  under  various  alternatives. 

7.1  Discussion  and  Methods 

Our  approach,  based  on  saddlepoint  approximation  in  approximating  the  tail 
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probability  as  discussed  in  Chapter  4,  is  again  quite  general  since  we  have  tried  to 
derive  the  tail  probability  approximation  for  the  two  sample  U-statistics  based  on  a 
kernel  of  degree  (Kj,K2)  under  very  minimal  assumptions  (including  a continuous 
distribution)  associated  with  the  underlying  distributions. 

Again,  to  approximate  power,  we  first  need  to  approximate  a few  expectations 
under  the  alternative  hypothesis.  Also  note  that  since  these  expectations  are  really 
conditional  expectations  of  the  kernel  (of  degree  Kj  and  K2),  they  may  be 
independent  of  the  sample  sizes  involved  if  the  kernel  is  independent  of  the  sample 
sizes.  If  these  expectations  are  independent  of  the  sample  sizes,  then  we  only  need 
to  approximate  them  once  under  a specific  distribution.  Once  they  have  been 
estimated,  we  can  use  them  for  various  power  calculations  of  the  U-statistic,  under 
the  same  alternative,  but  for  different  sample  sizes.  Their  structure  is  similar  to  (6.1) 
in  the  previous  chapter. 

To  illustrate  power  computations  for  two  sample,  again  we  used  Mann- 
Whitney-Wilcoxon  test  statistic.  The  form  for  these  expectations  has  already  been 
specified  in  Chapter  4 (two  sample  U-statistics).  Again,  if  we  are  interested  in 
testing  if  there  is  no  difference  between  the  underlying  population  distributions 
corresponding  to  the  two  samples,  then  the  null  distribution  is  simply  distribution 
free,  i.e.,  the  p-value  is  independent  of  the  underlying  distribution.  As  a result  the 
expectations  should  all  be  same  for  each  one  of  the  underlying  distributions 
(continuous). 

For  Mann-Whitney-Wilcoxon  Rank  Sum  test  we  had  to  evaluate  six  different 
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expectations  which  were  as  follows: 

(i)  £,  = £ g,  0 ( X ) go.i  ( y ) '1^1.1  (x,y) 

(ii)  E^  = E sIq  ( jc  ) go.i  ( y ) ti.i  ( y ) 

(iii)  £3  = Eg, j,ix)  gl,  ( y ) ( x,  y ) (7-1) 

(iv)  E,  = E til  (x,y) 

(v)  £3  = £ gjo  ( X ) gjo  ( x'  ) t,  j ( X,  y ) ti.i  (x',y) 

(vi)  £g  = £ goi  ( y ) gpj  ( y'  ) ti.i  ( Jc,  y ) ti.i  ( y'  )• 

Where,  under  the  location  shift  model  we  have, 

Xi  X„,  ~ F(X),  symmetric,  about  0. 

Yj Y„  ~ G(Y)  = F(Y  - A),  symmetric  about  A^O. 

it  p = £ r-x) 

then 


g,,,(X)  = 1 - G(X)  - p = (1  - p)  - G (X) 


(Y)  = F(Y)-p 


(7.2) 


and 


ti.i  ( X,  y ) = ljo„j  iY-X)^p-F(Y)^GiX)- 


The  computation  of  exact  distribution  (based  on  ranks)  for  two  sample  Mann- 
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Whitney  statistics  under  alternatives  to  hypothesis  F = G can  be  quite  tedious  and 
difficult  to  calculate  under  different  kinds  of  location  shift  alternatives.  Milton 


(1970)  provides  the  probabilities  (tail)  of  all  ( ) possible  rankings  of  X’s  and  Y’s 

for  1 ^ n ^ m ^ 7 for  normal  shift  alternatives  and  has  used  these  results  to  obtain 
exact  values  of  the  power  of  the  Wilcoxon  test  for  these  sample  sizes.  Bell,  Moser, 
and  Thompson  (1966)  give  the  power  of  the  Wilcoxon  test  for  rectangular  shift 
alternatives  while  Haynam  and  Govindarajulu  (1966)  do  so  for  exponential  shift 
alternatives  for  some  combinations  of  m and  n.  For  double  exponential  shift 
alternatives,  sample  sizes  m = n = 5,  and  a - 0.05,  Ramsey  (1971)  has  also  obtained 
power  for  this  test. 

Usually,  a normal  approximation  is  used  to  approximate  power  for  Mann- 
Whitney- Wilcoxon  test.  This  has  also  been  discussed  in  Lehmann’s  book  (1975)  on 
nonparametrics.  This  approximation  is  essentially  based  on  the  fact  that 


fvoFT^ 


- N (0,1) 


(7.3) 


as  m,n  tend  to  infinity  for  any  fixed  distributions  F and  G.  Here  W^y  is  Mann- 
Whitney  form  of  the  test  statistic.  For  this,  we  explicitly  need  to  calculate  mean  and 
variance  of  W,^y.  For  this,  we  further  need  to  estimate 

p^  = P(X<Y),  p2  = P(X<Yand  X < Y'  ) 


and 
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= P {X  < Y and  X'  < Y). 

Then 

E iW^)  = m • n • 

and 

Var  i W^)  = m n p^  { \ - p^)  * mn  n - \ ) i p^  - pI  ) + 

n m (m  - 1)(  p^  - pi  )• 

Also  observe  that  p2  = P3  in  the  shift  model  provided  the  distribution  is 
symmetric.  It  should  be  noted  that  this  normal  approximation  works  well  for  small 
values  of  A.  As  A becomes  large,  the  distribution  of  becomes  less  symmetric  and, 
therefore,  normal  approximation  less  accurate.  Lehmann  states  in  his  book  that  not 
much  is  known  about  non-normal  distributions.  But,  it  is  believed  that  for  these 
distributions,  this  approximation  may  be  accurate  for  very  large  sample  sizes. 

Again,  comparisons  were  carried  out  for  power  of  Mann-Whitney-Wilcoxon 
test  between  saddlepoint  method  using  approximated  cumulant  generating  function, 
normal  approximation  (equation  7.3)  and  exact  were  performed  under  normal, 
logistic,  and  Cauchy  shift  alternatives.  Similar  to  one  sample  case,  the  location  shift 
under  the  alternative  was  60,  70  and  80  percentiles  from  the  standardized 
distributions  in  each  of  the  three  (normal,  logistic  and  Cauchy)  cases.  Thus,  in  all 
nine  different  alternatives  were  taken  into  consideration. 

The  expectations  (Ej  ...  E^)  needed  to  approximate  the  cumulant  generating 
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Table  7.1.  Computation  of  Expectations  Needed  to  Approximate  the  Cumulant 
Generating  Function  for  Normal  Shift  Alternatives 


60  Percentile 

70  Percentile 

80  Percentile 

E, 

-0.00170874 

-0.00312077 

-0.00387317 

E^ 

-0.00255211 

-0.00191251 

-0.000987123 

E3 

-0.00255211 

-0.00191251 

-0.000987123 

E4 

0.082541 

0.0799171 

0.0744752 

E, 

0.00137143 

0.00130456 

0.00114007 

E. 

0.00137143 

0.00130456 

0.00114007 

Where 

= E ( X ) go  j ( y ) ti.i  ( X,  y ) 

E2  = E gfo  ( X ) go  j ( y ) ti  l ( y ) 

E3  = E gio  ( X ) goi  ( y ) ti.i  ( X,  y ) 

E^  = E til  ( X,  y ) 

E,  = E gio  ( X ) gio  ( x'  ) ti.i  ( Jc,  y ) ti.i  (x^,y) 

Ee  = E goi  ( y ) goi  ( y'  ) ti.i  ( Jc,  y ) ti.i  ( y'  )• 
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Table  7.2.  Computation  of  Expectations  Needed  to  Approximate  the  Cumulant 
Generating  Function  for  Logistic  Shift  Alternatives 


60  Percentile  70  Percentile  80  Percentile 


E,  -0.00108731 

Ej  -0.00261129 

Ej  -0.00261047 

E,  0.0824082 

E5  0.00135642 

Es  0.00135642 


-0.00202017 

-0.0021117 

-0.00211013 

0.0793982 

0.00124981 

0.00124981 


-0.00256961 

-0.00131622 

-0.00131369 

0.0731502 

0.00103606 

0.00103606 


Note:  Ej  - E^  are  defined  the  same  as  on  page  131. 


Table  7.3.  Computation  of  Expectations  Needed  to  Approximate  the  Cumulant 
Generating  Function  for  Cauchy  Shift  Alternatives 


60  Percentile  70  Percentile 


80  Percentile 


E,  0.000905932 

Ej  -0.00277992 

E3  -0.00277992 

E4  0.0820116 

E,  0.00135368 

Ej  0.00135368 


0.00183317 

-0.00277597 

-0.00277597 

0.0772355 

0.00122829 

0.001220829 


0.0026385 

-0.0026872 

-0.0026872 

0.0655041 

0.000940184 

0.000940184 


Note:  E,  - Ej  are  defined  the  same  as  on  page  131. 
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function  under  the  different  shift  alternatives,  have  been  tabulated  in  Tables  7.1  for 
normal,  12  for  logistic,  and  7.3  for  Cauchy. 

12  Results 

Wilcoxon  Rank  Sum  statistic  in  it’s  Mann-Whitney  form  (n=5,m=  10)  was  used 
as  an  example  for  comparison  purposes.  We  again  considered  location  shift 
alternatives  under  normal,  logistic  and  Cauchy  distributions.  Standardized  forms  of 
these  distributions  (each  symmetric  about  zero)  were  used  as  a underlying  population 
distribution  for  one  population  while  for  the  other,  a corresponding  location  shift 
alternative  was  used  as  the  population  distribution.  The  location  shift  parameter, 
delta  (A)  took,  60‘^  70“*  and  80“’  percentiles  of  the  corresponding  standardized 
distribution  as  it’s  values. 

Nine  tables  (7.4  - 7.12)  help  us  to  compare  the  exact  tail  probability  of  the 
Wilcoxn-Mann-Whitney  test  statistic  with  both,  the  proposed  saddlepoint  based  tail 
probability  approximation  and  the  large  sample  (normal)  approximation.  To  get  a 
better  feel  for  these  results  we  plotted  left  tail  probabilities  (see  Figures  7.1  and  7.2) 
given  in  Tables  7.8  (logistic  with  A = 0.847298)  and  7.11  (Cauchy  with  A = 0.726543) 
for  the  exact  and  the  two  approximations.  Again  we  used  10  million  simulations  to 
compute  the  exact  tail  probability  in  these  tables.  Equations  7.4  and  7.5  were  used 
to  calculate  the  mean  and  variance  under  each  of  the  nine  alternatives. 

Again,  our  proposed  saddlepoint  based  tail  probability  performed  consistently 
better  as  compared  to  normal  approximation  under  all  the  alternatives.  If  it  is  felt 
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Table  7.4.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
Wilcoxon  Test  Statistic  (n=5,m=10)  under  Normal  Alternative(A= 0.253347) 
between  Exact  Distribution,  Normal  approximation  and  Saddlepoint  Approximations. 


w 

Exact 

Sp_app 

Normal 

Left  Tail 

9 

-2.4249 

0.0098 

0.0088 

0.0077 

11 

-2.1769 

0.0190 

0.0178 

0.0147 

13 

-1.9289 

0.0343 

0.0330 

0.0269 

15 

-1.6808 

0.0577 

0.0567 

0.0464 

17 

-1.4328 

0.0913 

0.0910 

0.0760 

19 

-1.1848 

0.1368 

0.1377 

0.1180 

21 

-0.9368 

0.1956 

0.1974 

0.1744 

23 

-0.6888 

0.2672 

0.2697 

0.2455 

25 

-0.4408 

0.3500 

0.3528 

0.3297 

27 

-0.1928 

0.4410 

0.4435 

0.4236 

Right  Tail 

29 

0.0553 

0.5116 

0.5122 

0.4780 

31 

0.3033 

0.4162 

0.4175 

0.3808 

33 

0.5513 

0.3244 

0.3266 

0.2907 

35 

0.7993 

0.2406 

0.2436 

0.2121 

37 

1.0473 

0.1687 

0.1720 

0.1475 

39 

1.2953 

0.1107 

0.1146 

0.0976 

41 

1.5433 

0.0671 

0.0739 

0.0614 

where 

W 

Exact 

Sp_app 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.5.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
Wilcoxon  Test  Statistic  (n=5,m=10)  under  Normal  Alternative(A=0.524401) 
between  Exact  Distribution,  Normal  Approximation  and  Saddlepoint 
Approximations. 


w 

W.,d 

Exact 

Sp_app 

Normal 

Left  Tail 

9 

-3.0007 

0.0026 

0.0022 

0.0013 

11 

-2.7423 

0.0057 

0.0049 

0.0031 

13 

-2.4840 

0.0112 

0.0100 

0.0065 

15 

-2.2257 

0.0205 

0.0191 

0.0130 

17 

-1.9673 

0.0356 

0.0341 

0.0246 

19 

-1.7090 

0.0585 

0.0574 

0.0437 

21 

-1.4506 

0.0914 

0.0911 

0.0734 

23 

-1.1923 

0.1365 

0.1371 

0.1166 

25 

-0.9340 

0.1948 

0.1967 

0.1752 

27 

-0.6756 

0.2667 

0.2698 

0.2496 

29 

-0.4173 

0.3512 

0.3548 

0.3382 

31 

-0.1589 

0.4454 

0.4505 

0.4369 

Right  Tail 

33 

0.0994 

0.5049 

0.5134 

0.4604 

35 

0.3577 

0.4050 

0.4064 

0.3603 

37 

0.6161 

0.3086 

0.3111 

0.2689 

39 

0.8744 

0.2206 

0.2248 

0.1909 

41 

1.1328 

0.1462 

0.1526 

0.1287 

where 

W 

W,.d 

Exact 

Sp_app 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.6.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
WilcoxonTestStatistic(n=5,m= 10)  under  Normal  Alternative(A=0.841621)  between 
Exact  Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


w 

Exact 

Sp_app 

Normal 

Left  Tail 

11 

-3.53% 

0.0011 

0.0008 

0.0002 

13 

-3.2588 

0.0023 

0.0018 

0.0006 

15 

-2.9780 

0.0048 

0.0040 

0.0015 

17 

-2.6970 

0.0092 

0.0081 

0.0035 

19 

-2.4162 

0.0170 

0.0155 

0.0078 

21 

-2.1353 

0.0296 

0.0281 

0.0164 

23 

-1.8545 

0.0493 

0.0480 

0.0318 

25 

-1.5736 

0.0783 

0.0780 

0.0578 

27 

-1.2928 

0.1196 

0.1204 

0.0980 

29 

-1.0119 

0.1750 

0.1774 

0.1558 

31 

-0.7311 

0.2461 

0.2500 

0.2324 

33 

-0.4502 

0.3323 

0.3374 

0.3263 

35 

-0.1693 

0.4315 

0.4367 

0.4328 

Right  Tail 

37 

0.1115 

0.5149 

0.5131 

0.4556 

39 

0.3924 

0.4048 

0.4048 

0.3474 

41 

0.6732 

0.2970 

0.2989 

0.2504 

43 

0.9541 

0.1992 

0.2027 

0.1700 

45 

1.2349 

0.1195 

0.1240 

0.1084 

where 

W 

Exact 

Sp_app 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.7.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
Wilcoxon  Test  Statistic  (n=5,m  = 10)  under  Logistic  Alternative(A= 0.405465) 
between  Exact  Distribution,  Normal  Approximation  and  Saddlepoint 
Approximations. 


w 

W„a 

Exact 

Sp_app 

Normal 

Left  Tail 

7 

-2.6440 

0.0048 

0.0047 

0.0040 

9 

-2.3964 

0.0103 

0.0097 

0.0083 

11 

-2.1489 

0.0200 

0.0189 

0.0158 

13 

-1.9013 

0.0360 

0.0349 

0.0286 

15 

-1.6538 

0.0604 

0.0597 

0.0491 

17 

-1.4062 

0.0952 

0.0955 

0.0798 

19 

-1.1586 

0.1424 

0.1439 

0.1233 

21 

-0.9111 

0.2027 

0.2052 

0.1811 

23 

-0.6635 

0.2756 

0.2789 

0.2535 

25 

-0.4160 

0.3594 

0.3629 

0.3387 

27 

-0.1684 

0.4509 

0.4524 

0.4331 

Right  Tail 

29 

0.0792 

0.5016 

0.5113 

0.4780 

31 

0.3267 

0.4065 

0.4080 

0.3808 

33 

0.5743 

0.3156 

0.3180 

0.2907 

35 

0.8218 

0.2335 

0.2364 

0.2121 

37 

1.0694 

0.1630 

0.1664 

0.1475 

39 

1.3170 

0.1067 

0.1105 

0.0976 

41 

1.5645 

0.0644 

0.0709 

0.0614 

where 

W 

Exact 

Sp_app 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.8.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
Wilcoxon  Test  Statistic(n=5,m=  10)  under  Logistic  Alternative(A= 0.847298)  between 
Exact  Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


W Exact  Sp_app  Normal 


Left  Tail 


9 

-2.9362 

0.0029 

0.0023 

0.0016 

11 

-2.6797 

0.0063 

0.0053 

0.0037 

13 

-2.4232 

0.0124 

0.0109 

0.0077 

15 

-2.1668 

0.0227 

0.0209 

0.0151 

17 

-1.9103 

0.0393 

0.0374 

0.0280 

19 

-1.6538 

0.0640 

0.0626 

0.0491 

21 

-1.3973 

0.0994 

0.0989 

0.0812 

23 

-1.1408 

0.1469 

0.1479 

0.1270 

25 

-0.8844 

0.2079 

0.2104 

0.1882 

27 

-0.6279 

0.2823 

0.2860 

0.2650 

29 

-0.3714 

0.3686 

0.3727 

0.3552 

Right  Tail 

33 

0.1416 

0.4869 

0.4885 

0.4437 

35 

0.3980 

0.3882 

0.3908 

0.3453 

37 

0.6545 

0.2941 

0.2988 

0.2564 

39 

0.9110 

0.2092 

0.2168 

0.1811 

41 

1.1675 

0.1379 

0.1515 

0.1215 

where 

W 

Exact 

Spapp 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.9.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
Wilcoxon  Test  Statistic  (n=5,m=  10)  under  Logistic  Alternative(A= 1.38629)  between 
Exact  Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


W W„j  Exact  Sp_app  Normal 


Left  Tail 


14 

-3.0234 

0.0039 

0.0032 

0.0012 

16 

-2.7467 

0.0077 

0.0066 

0.0030 

18 

-2.4701 

0.0145 

0.0128 

0.0067 

20 

-2.1935 

0.0257 

0.0237 

0.0141 

22 

-1.9169 

0.0434 

0.0413 

0.0276 

24 

-1.6403 

0.0698 

0.0683 

0.0505 

26 

-1.3637 

0.1075 

0.1073 

0.0863 

28 

-1.0871 

0.1589 

0.1603 

0.1385 

30 

-0.8105 

0.2254 

0.2284 

0.2088 

32 

-0.5339 

0.3069 

0.3113 

0.2967 

34 

-0.2573 

0.4015 

0.4069 

0.3985 

Right  Tail 

37 

0.1577 

0.4946 

0.4945 

0.4373 

39 

0.4343 

0.3865 

0.3889 

0.3320 

41 

0.7109 

0.2820 

0.2894 

0.2386 

43 

0.9875 

0.1891 

0.2054 

0.1617 

where 

W 

Exact 

Sp_app 

Normal 


Mann-Whitney-WUcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.10.  Comparison  of  Tail  Probability  Approximation  of  Mann-Whitney- 
Wilcoxon  Test  Statistic  (n=5,m=  10)  under  Cauchy  Alternative(A= 0.32492)  between 
Exact  Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


w 

Exact 

Sp_app 

Normal 

Left  Tail 

8 

-2.4061 

0.0090 

- 

0.0081 

10 

-2.1601 

0.0182 

0.0221 

0.0154 

12 

-1.9141 

0.0336 

0.0336 

0.0278 

14 

-1.6682 

0.0571 

0.0513 

0.0476 

16 

-1.4222 

0.0915 

0.0837 

0.0775 

18 

-1.1762 

0.1381 

0.1395 

0.1198 

20 

-0.9302 

0.1982 

0.2003 

0.1761 

22 

-0.6842 

0.2707 

0.2736 

0.2469 

24 

-0.4382 

0.3539 

0.3571 

0.3306 

26 

-0.1923 

0.4450 

0.4474 

0.4238 

Right  Tail 

28 

0.0537 

0.5082 

0.4915 

0.4786 

30 

0.2997 

0.4139 

0.4158 

0.3822 

32 

0.5457 

0.3237 

0.3269 

0.2926 

34 

0.7917 

0.2417 

0.2460 

0.2143 

36 

1.0377 

0.1713 

0.1761 

0.1497 

38 

1.2836 

0.1144 

0.1111 

0.0996 

40 

1.5296 

0.0715 

0.0725 

0.0631 

where 

W 

W.,a 

Exact 

Sp_app 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.11.  Comparison  of  Tail  Probability  Approximation  for  Mann-Whitney- 
Wilcoxon  Test  Statistic  (n=5,m=10)  under  Cauchy  Alternative  (A  = 0.726543) 
between  Exact  Distribution,  Normal  Approximation  and  Saddlepoint 
Approximations. 


w 

w.„ 

Eixact 

Spapp 

Normal 

Left  Tail 

10 

-2.5676 

0.0070 

- 

0.0051 

12 

-2.3176 

0.0140 

0.0133 

0.0102 

14 

-2.0677 

0.0257 

0.0246 

0.0193 

16 

-1.8178 

0.0443 

0.0430 

0.0346 

18 

-1.5678 

0.0722 

0.0711 

0.0585 

20 

-1.3179 

0.1112 

0.1108 

0.0938 

22 

-1.0679 

0.1627 

0.1636 

0.1428 

24 

-0.8180 

0.2275 

0.2298 

0.2067 

26 

-0.5681 

0.3049 

0.3081 

0.2850 

28 

-03181 

0.3927 

0.3960 

0.3752 

30 

-0.0682 

0.4871 

0.4894 

0.4728 

Right  Tail 

32 

0.1817 

0.4655 

0.4659 

0.4279 

34 

0.4317 

0.3707 

0.3730 

0.3330 

36 

0.6816 

0.2816 

0.2862 

0.2477 

38 

0.9316 

0.2020 

0.2093 

0.1758 

40 

1.1815 

0.1365 

0.1461 

0.1187 

42 

1.4314 

0.0846 

_ 

0.0762 

where 

W 

Exact 

Sp_app 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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Table  7.12.  Comparison  of  Tail  Probability  Approximation  for  Wilcoxon-Mnn- 
Whitney  Test  Statistic  (n=5,m= 10)  under  Cauchy  Alternative  (A=  1.37638)  between 
Exact  Distribution,  Normal  Approximation  and  Saddlepoint  Approximations. 


w 

Exact 

Spapp 

Normal 

Left  Tail 

14 

-2.6889 

0.0069 

0.0063 

0.0036 

16 

-2.4278 

0.0133 

0.0123 

0.0076 

18 

-2.1666 

0.0241 

0.0225 

0.0151 

20 

-1.9055 

0.0412 

0.0390 

0.0284 

22 

-1.6443 

0.0665 

0.0642 

0.0501 

24 

-1.3832 

0.1027 

0.1007 

0.0833 

26 

-1.1220 

0.1514 

0.1505 

0.1309 

28 

-0.8609 

0.2147 

0.2146 

0.1946 

30 

-0.5997 

0.2912 

0.2924 

0.2743 

32 

-0.3386 

0.3785 

0.3817 

0.3675 

34 

-0.0775 

0.4739 

0.4784 

0.4691 

Right  Tail 

36 

0.1837 

0.4765 

0.4753 

0.4271 

38 

0.4448 

0.3745 

0.3785 

0.3282 

40 

0.7060 

0.2813 

0.2884 

0.2401 

42 

0.9671 

0.1937 

0.1900 

0.1667 

44 

1.2283 

0.1275 

- 

0.1097 

where 

W 

Exact 

Spapp 

Normal 


Mann-Whitney-Wilcoxon  Test  Statistic 
Standardized  Test  Statistic 

Exact(Simulated)  Distribution  of  T.Stat  Under  Alternative 
Saddlepoint  Approximation  Based  on  Our  Result 
Normal  Approximation 
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that  the  saddlepoint  approximation  to  the  tail  probability  is  not  as  acxjurate  as 
desired  (specially  in  the  tails),  then  it  may  be  worthwhile  to  consider  including  more 
higher  order  terms  in  the  expansion  for  the  cumulant  generating  function  as  given 
in  Chapter  4. 
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